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Give soils their due 


W e are not paying enough attention to the 
world’s soils, a “nearly forgotten resource” 
and our “silent ally,” 33% of which are in a 
state of degradation.* We can’t breathe, eat, 
drink, or be healthy without sustainably 
managing soils. So in recognizing 2015 as 
the International Year of Soils, the United 
Nations (UN) is focusing global attention on the increas- 
ing pressures on soils and their ripple effect on other 
global challenges. 

A major concern is whether soils will support the grow- 
ing demand for food. Human 
activities have transformed 
soils, lands, and regions, 
with long-lasting effects 
that include desertification, 
decreased organic matter 
in soils, altered biodiversity, 
and changed biogeochemical 
and hydrological cycles. As a 
result, the land available for 
food production is shrink- 
ing, irreversibly in some 
cases. Converting cropland 
to biofuel systems and urban 
centers is having the same 
effect. Agricultural practices 
have increased soil erosion 
to rates much greater than 
those of soil formation (it 
can take up to 1000 years 
to form 1 cm of soil). The 
mismanagement of soil re- 
sources is exacerbating these 
assaults on the food supply. 

The good news is that there is a mandate to improve 
soil management. With soil security now a priority, sev- 
eral Draft UN Sustainable Development Goals (2016 to 
2030) directly and indirectly involve soils. Goals 2 and 
15, for example, target sustainable food production and 
the use of lands. Nations also are addressing the major 
consequences of the misuse and mismanagement of soils 
through focused agendas: degraded land (UN Convention 
to Combat Desertification), loss of biodiversity and eco- 
system services (UN Convention on Biological Diversity), 
and increased greenhouse gas emissions (UN Framework 
Convention on Climate Change). Further, the Global Soil 
Partnership— a voluntary partnership between national 
governments, concerned stakeholders (including univer- 
sities, industry, and landowners), and nongovernment 
organizations— is facilitating collaborations among these 


conventions. Its achievements are promising: An Inter- 
governmental Technical Panel on Soils is now advising on 
global soil issues. 

Exploration of soil’s unique habitats reveals numerous 
microbes and invertebrates that contribute to life-sustain- 
ing services such as cleansing water, regulating pests, and 
cycling nutrients. Connections between different soil bi- 
ota can be severed through mismanagement of all lands: 
cities, forests, deserts, grasslands, and agricultural fields. 
Hence, this connectedness extends to bonds between soil 
biota and humans, and we must improve the function- 
ing of soil biota as part of our 
long-term commitment to a 
sustainable future. 

A holistic management 
approach to soils requires 
understanding that human 
health depends on nonde- 
graded soils not only for 
food but for clean air and 
water. Air pollutants derived 
from disturbed soils include 
volatile organic compounds, 
greenhouse gases, dust, and 
biota. These are transported 
by wind for hundreds to 
thousands of miles. The im- 
pact of such mobility by po- 
tential pathogens such as 
parasitic worms on plants, 
animals, and humans is be- 
coming clearer. Because soils 
are also one of the largest 
stores of carbon that is in 
direct exchange with the at- 
mosphere, soil degradation negatively affects society via 
climate change feedbacks. The water we drink depends on 
maintaining soils that store, filter, and cleanse water. Al- 
though the soil-clean air-clean water-human health link- 
age has led to air and water regulations, they do not address 
the cause of the impacts: the mismanagement of soil. 

The 2015 International Year of Soils is an occasion 
to celebrate and raise awareness of Earth’s soil and its 
functions for humanity. As we pave fertile soils for cities, 
expand agriculture into marginal lands such as polar re- 
gions and deserts, and address climate change impacts of 
droughts and floods, we should consider the benefits that 
managing soils provides for multiple global environmen- 
tal issues. As US. President Franklin D. Roosevelt said, 
“A nation that destroys its soils destroys itself.” 

- Diana H. Wall and Johan Six 



“Wfe can’t breathe, eat, drink, or 
be healthy without sustainably 
managing soils.” 
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It is one thing to debate the merits of a 
theory. It is quite another to impugn a person’s 
character with innuendos. 59 

A 5 February British Columbia Supreme Court ruling that Canada’s National 
Post defamed climate scientist Andrew Weaver and must retract several articles. 



A little brown bat with 
white-nose syndrome hangs 
in Vermont’s Greeley Mine. 


Fungus has decimated bat colonies 

n just 7 years, a fungal disease known as white-nose syn- 
drome has killed more than 5 million North American 
bats, nearly wiping out entire colonies. Initially identi- 
fied by a white fungus growing on bats’ noses, the dis- 
ease drains hibernating bats of their energy reserves. 
Now, a study in Global Ecology and Biogeography takes 
a closer look at how the size of a bat colony affects the like- 
lihood of local extinction. They studied more than 1100 
winter colonies of bats in North America, poring over 4 de- 
cades of population counts between 1976 and 2013. Since 
its discovery in North America in 2006, the disease has 
reduced populations of North American bats in the colo- 
nies by 60% to 98%, the researchers report. For five out of 
six species of hibernating bats studied in eastern North 
America, larger winter colonies helped insulate against lo- 
cal extinction— but for the sixth and most affected species, 
the northern long-eared bat, extinction risk was constant 
regardless of colony size. Resulting increases in mosqui- 
toes and agricultural pests could lead to damaged crops 
and increased spread of human diseases. 


AROUND THE WORLD 

A new name for chronic fatigue 

WASHINGTON, D.c. | A Committee Con- 
vened by the Institute of Medicine (lOM) 
has proposed a new name for a condi- 
tion known as chronic fatigue syndrome 
or myalgic encephalomyelitis: systemic 
exertion intolerance disease (SEID). After 
reviewing thousands of studies, expert 
testimony, and public input, the committee 
concluded that “the name ‘chronic fatigue 
S3mdrome’ has done a disservice to many 
patients” as it was “stigmatizing and trivi- 
alizing,” while myalgic encephalomyelitis 
“does not accurately describe the major 
features of the disease.” In a 235-page 
report released on 10 February, the com- 
mittee also suggested diagnostic criteria 
for SEID that focus on “central S 5 miptoms” 
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such as reduced ability to work and study 
and “unrefreshing” sleep. Criteria used by 
the U.S. Centers for Disease Control and 
Prevention were deemed “overly inclusive, 
particularly of patients whose S3miptoms 
may be caused by a psychiatric disorder.” 
An estimated 836,000 to 2.5 million 
Americans have these disorders, but lOM 
notes that fewer people will meet the new, 
stricter criteria. http://scim.ag/chronSEID 

New light source shines 

UPTON, NEW YORK | On 6 February 
Department of Energy (DOE) Secretary 
Ernest Moniz dedicated the new $912 mil- 
lion National S 3 mchrotron Light Source II 
(NSLS-II) at DOF’s Brookhaven National 
Laboratory in Upton, New York. The 
NSLS-II will be the brightest synchrotron 


light source in the United States and— 
within a certain energy range— the world. 
NSLS-II research “will probe the funda- 
mental structure of novel materials and 
help drive the development of low-cost, 
low-carbon energy technologies, spark 
advances in environmental science, and 
spur medical breakthroughs,” Moniz said. 
The NSLS-II, which will be 10,000 times 
as bright as its predecessor, will produce 
extremely intense beams of x-ray, ultra- 
violet, and infrared light. It will allow 
researchers to probe the properties of 
materials at resolutions approaching 
10 nanometers. http://scim.ag/NSLSII 

NIH old scientist award panned 

BETHESDA, MARYLAND | Au idea from the 
National Institutes of Health (NIH) to nudge 
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aging scientists to retire is being blasted in 
the blogosphere. The potential “emeritus 
award,” described in a 3 February notice 
and NIH’s Rock Talk blog, would allow 
senior investigators to wind down their labs 
or transfer the work to a junior colleague. 
Many of the more than 140 comments on 
the blog post argue that such an award is 
unnecessary and could take funding away 
from younger researchers. NIH Deputy 
Director for Extramural Research Sally 
Rockey says she’s not surprised by the nega- 
tive responses: “In tight budget times, any 
proposed new award creates angst that it 
will have an impact on the rest of the pool” 
of investigators. Rockey encourages critics to 
submit formal comments, http://scim.ag/oldsci 

Stem cell odd couple team up 

SEOUL I Woo Suk Hwang, who fraudu- 
lently claimed to have created embryonic 
stem cells matched to human patients, 
and Shoukhrat Mitalipov, who really did 
it, will conduct joint research, reported 
South Korea’s Dong-A Ilbo newspaper this 
week. In 2006, Hwang retracted two papers 
published in Science and was later convicted 
for embezzling research funds and bioethics 
violations. Mitalipov, of the Oregon Health 
& Science University in Portland, reported 
in May 2013 that he had derived stem cells 
from cloned human embryos. Hwang told 
the newspaper that he, Mitalipov, and 
Xiaochun Xu, CEO of Boyalife Group in 
Wuxi, China, will jointly work on cloning 
mechanisms, with an eye to curing inherited 
mitochondrial disease. Boyalife, a Chinese 
regenerative medicine company, will put up 
about $93 million, http://scim.ag/stemodd 

DOE scraps carbon capture plant 

WASHINGTON, D.c. | The US. Department 
of Energy (DOE) has again pulled out 
of FutureGen, the troubled project to 
create the first commercial-scale power 
plant in the United States that captures 
and sequesters CO^ emissions. Originally 
conceived under President George W. Bush 
in 2003, the project was previously aban- 
> doned by the US. government in 2008 due 
J to rising costs. In 2010, President Barack 
I Obama earmarked $1 billion for FutureGen 
I 2.0, a $1.7 billion revised version that 
I would retrofit a coal-fired power plant in 
y Illinois. The stimulus money would have 
I to be spent by September 2015. However, 

^ permitting delays and legal challenges 
I by environmental groups have slowed its 
% progress, and on 3 February, FutureGen 
I Alliance CEO Ken Humphreys announced 
^ that “the DOE has concluded that there is 
d insufficient time to complete the project.” 



Bolivia’s Potosf silver mine, shown here in 1884, produced significant air pollution. 


Anthropocene started earlier in the Andes 

D ust and metals preserved in an ice core from the Queiccaya Ice Cap high in 
the Peruvian Andes indicate that humans began polluting the region centuries 
before the industrial revolution arrived— and suggest that the Anthropocene, 
a geological epoch defined by humans’ influence on the planet, began at 
different times around the world. The ice core records centuries of South 
American air pollution from mining during precolonial times through 1989, researchers 
report online this week in the Proceedings of the National Academy of Sciences. Air 
pollution really took off when the Spanish colonized South America in the 16th century: 
one major culprit was likely a gigantic silver mine in Potosf, Bolivia. Lead levels in the 
ice core nearly doubled between 1450 C.E. and 1900 C.E. Although most of South 
America’s air pollution was released in the 20th century, colonial mines like Potosf 
had such a dramatic impact on the environment— 240 years before the industrial 
revolution— that they should be considered the beginning of the Anthropocene in the 
region, the researchers say. http://scim.ag/Andesmine 
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$1.09 

billion 

Amount of money paid, as of 
31 December 2014, to support the 
international response to the Ebola 
outbreak, or 37.7% of the $2.89 bil- 
lion pledged by donors. 

250 

Percent by which total energy used 
by consumer electronics rose in 
an average U.S. household from 
1992 to 2007, according to a study in 
Environmental Science & Technology. 



Shattered chromosome cure 

BETHESDA, MARYLAND | A gM who grew 
up with a serious genetic immune disease 
was apparently cured in her 30s by one of 
her chromosomes shattering into pieces 
and reassembling. As a 9-year-old, the 
woman had the first known case of WHIM 
syndrome, which results in warts and 
frequent infections from low levels of cer- 
tain white blood cells. Her two daughters 
developed the same rare disease through 
a fault in the CXCR4 gene, but the woman, 
now 59, no longer has symptoms. National 
Institutes of Health (NIH) researchers 
traced her improvement to chromothripsis, 
a phenomenon discovered 4 years ago in 
which a chromosome somehow shatters 
during cell division, then reforms with 
the pieces scrambled. Although this can 
contribute to cancer, it seems to have 
cured the woman by deleting her flawed 
copy of CXCR4 from a blood stem cell, 
the NIH team reported in Cell. 
http://scim.ag/shatterchrom 


Push to protect farm animals 

WASHINGTON, D.c. | The U.S. Congrcss 
last week proposed new protections for 
farm animals used in scientific research. 
The move comes in response to an expose 
published in The New York Times last 
month, which documented numerous 
cases of animal suffering and death at a 
Department of Agriculture facility that 
has been trjdng to create larger and more 
fecund farm animals for several decades. 
Lawmakers from both parties are backing 
a bill— called the AWARE Act— that would 
expand the scope of the Animal Welfare 
Act, which governs the humane treatment 
of laboratory animals. Farm animals are 
currently excluded from the act, unless 
they’re used in biomedical research or 
exhibition. The new law would require 
closer monitoring— and more inspections— 
of research involving cows, pigs, and other 
livestock, http://scim.ag/farmani 


NEWSMAKERS 



Head of FDA to step down 

After serving nearly 6 years as head of 
the U.S. Food and Drug Administration 
(FDA), Margaret Hamburg revealed last 
week that she will step down at the end of 
March. Her e-mail announcement to FDA 
staff, later posted on the agency’s website, 
provided little detail about the decision. 
The departure comes in the middle of a 
major overhaul of the agency’s food safety 
oversight and as legislators prepare 
to propose key changes to its medical 
product review and approval process. 

FDA Chief Scientist Stephen Ostroff will 
fill her position until President Barack 
Obama appoints a new commissioner. 
Duke University cardiologist Robert Califf, 
who will step in as deputy commissioner 
later this month, is rumored to be a 
likely successor. 


Rubble in L’Aquila, Italy, in April 2009, days after an earthquake killed more than 300 people. 


Why Italian earthquake scientists were exonerated 

S ix scientists convicted of manslaughter in 2012 for advice they gave ahead of 
the deadly LAquila earthquake 3.5 years earlier were victims of “uncertain and 
fallacious” reasoning, found three judges who acquitted the experts in an appeal 
trial last November. The judges also reduced the sentence of a seventh defen- 
dant, a public official, from 6 years to 2 years. In a 389-page document deposited 
in court 6 February and since released to the public, the magistrates accepted the 
controversial idea that officially sanctioned reassurances were decisive in causing 
some of the quake victims to stay indoors— but ruled that those reassurances were the 
exclusive fault of the public official, at the time deputy head of Italy’s Civil Protection 
Department, and no blame can be attributed to the scientists. http://scim.ag/LAquilaapp 
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BIOTECHNOLOGY 

Afficidtuml researchers rattled 
by demands for documents 

Group opposed to GM foods asks a dozen scientists to hand 
over letters, e-mails in probe of academic-industry ties 


By Keith Kloor 

T he fierce public relations war over 
genetically modified (GM) food has a 
new front. A nonprofit group opposed 
to GM products filed a flurry of free- 
dom of information requests late last 
month with at least four U.S. universi- 
ties, asking administrators to turn over any 
correspondence between a dozen academic 
researchers and a handful of agricultural 
companies, trade groups, and PR firms. The 
scientists— many of whom have publicly sup- 
ported agricultural biotechnologies— are de- 
bating how best to respond, and at least one 
university has already rejected the request. 

^ “It seems like a fishing expedition to me,” 
I says geneticist Alison Van Eenennaam of the 
I University of California (UC), Davis, one of 
^ six UC researchers targeted by the requests. 
I “I am very worried [the correspondence] is 
§ going to be used to sully the reputations of 
g scientists.” The tactic is familiar in another 
I controversial area, climate science, where 
I researchers have faced an avalanche of docu- 
g ment requests from climate change skeptics. 
i The group, U.S. Right to Know (USRTK) of 


Oakland, California, says it has no vendetta. 
It has targeted only researchers who have 
written articles posted on GMO Answers, 
a website backed by food and biotechnol- 
ogy firms, and work in states with laws that 
require public institutions to share many 
internal documents on request. Executive 
Director Gary Ruskin told Science. USRTK 
is interested in documenting links between 
universities and business, he says, and is “es- 
pecially looking to learn how these faculty 
members have been appropriated into the 
PR machine for the chemical-agro industry.” 

Ruskin is no stranger to the GM food de- 
bate. He helped manage an unsuccessful 
2012 effort to pass a California ballot initia- 
tive requiring the labeling of food products 
containing GM ingredients. Late last year, he 
helped found USRTK, which works “to ex- 
pose what the food industry doesn’t want us 
to know. ... We stand up for the right to know 
what is in our food and how it affects our 
health.” The group’s three board members 
include Juliet Schor, a prominent economist 
at Boston College. USRTK’s website says its 
sole major donor (more than $5000) is the 
Organic Consumers Association, a nonprofit 


GM food opponents, like these in Los Angeles, 
are adopting new strategies that put academics 
on the spot. 

group based in Finland, Minnesota. 

In the requests, Ruskin seeks all letters 
and e-mails exchanged after 2012 between 
the scientists and 14 companies and groups. 
The list includes Monsanto, Syngenta, Du- 
Pont, Dow, major biotech and grocery trade 
groups, and communications firms including 
FleishmanHillard and Ogilvy & Mather. “The 
records disclosed ... will be used in prepara- 
tion of articles for dissemination to the pub- 
lic,” states one request obtained by Science. 

Many researchers are awaiting advice 
from university lawyers on how to re- 
spond. Kevin Folta, a biologist and biotech 
researcher at the University of Florida in 
Gainesville, would like to comply. But he an- 
ticipates trouble. “Unfortunately, when you 
skim through the 70,000 e-mails I have ... 
[USRTK] will find opportunities to pull out a 
sentence and use it against me,” he predicts. 
“They will show I have 200 e-mails from big 
ag companies. While it is former students 
... or chitchat about someone’s kids, it won’t 
matter. They’ll report, ‘Kevin Folta had 200 e- 
mails with Monsanto and Syngenta,’ as a way 
to smear me.” 

USRTK has asked food allergy researcher 
Richard Goodman, a former Monsanto em- 
ployee who has been at the University of 
Nebraska, Lincoln, since 2004, for any cor- 
respondence with his old firm related to a 
controversial study led by biologist Gilles- 
Eric Seralini of the University of Caen in 
France. The study, which claimed that 
foods caused health problems in rats, was 
published in Food and Chemical Toxicol- 
ogy in 2012 but was withdrawn in 2013, the 
same year Goodman became an associate 
editor of the journal. 

Toxicologist Bruce Chassy who retired in 
2012 from the University of Illinois, Urbana- 
Champaign, understands why he is a target. 
“I suspect a disclosure would make me look 
bad,” he says, noting he regularly interacts 
with firms that produce GM products and 
has urged them to do more to answer the 
technology’s critics. But the school’s lawyers 
rejected USRTK’s request on 4 February, not- 
ing Chassy no longer works at the university. 

USRTK says its requests are designed 
to promote transparency in a controver- 
sial research arena. But some researchers 
worry they will also have a chilling effect 
on academic freedom. “Your first inclina- 
tion ... is to stop talking about the subject,” 
Van Eenennaam says. “But that’s what they 
want. And I don’t want to be intimidated.” ■ 


Keith Kloor is a freelance journalist living 
in New York City. 
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ENDANGERED SPECIES 

Captive pandas succumb to killer virus 

Deaths at breeding center in China have put scientists on edge 


By Mara Hvistendahl 

» fatal virus is felling pandas at a breed- 
ing center in China, raising questions 
about management of the iconic en- 
dangered species and prompting the 
State Forestry Administration (SFA) 
to institute strict quarantine and dis- 
ease control measures. 

The outbreak started with the death of 
an adult female at the Shaanxi Rare Wild- 
life Rescue and Breeding Research Center, 
near Xi’an, in early December. Chinese state 
media report that three more pandas have 
died— the latest on 4 February— and a fifth is 
stable. Officials are blaming canine distem- 
per, a highly contagious virus often spread by 
dogs that in recent years has taken a heavy 
toll on lions, seals, and other mammals. 

There is no sign that the virus has spread 
to other reserves or to the wild pandas on 
the other side of the Qin Mountains from 
the Shaanxi center, which housed 25 pan- 
das before the outbreak. Still, the cases 
have rattled panda researchers at a time 
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when they are still waiting for the release of 
figures from China’s fourth national panda 
survey, expected months ago. The deaths 
have also exposed concerns that China is 
overemphasizing captive breeding at the 
expense of conserving the animals’ remain- 
ing wild habitat in the mountains of west- 
ern China, where the last survey, in 2002, 
tallied some 1600 pandas. 

It’s unclear whether panda handling 
practices contributed to the outbreak in 
Xi’an. Chinese press reports suggest that 
the breeding center, which also houses 
other rare animals, is close to a residen- 
tial area where villagers keep unvaccinated 
dogs. Although the virus can be deadly in 
animals— a 1994 outbreak killed an esti- 
mated 30% of all wild lions in Serengeti 
National Park— it is preventable in captive 
pandas, says Kati Loeffler, a veterinarian 
with the International Fund for Animal 
Welfare in Yarmouth Port, Massachusetts. 
Loeffler, who is the former director of ani- 
mal health at the Chengdu Research Base 
of Giant Panda Breeding in China, says the 


Chengdu center has used a recombinant ca- 
nine distemper vaccine that worked well in 
pandas in the past. But it is unclear to what 
extent smaller centers like the Shaanxi one 
use the vaccine. 

To avoid more deaths, the center’s four 
remaining sick pandas and 17 others re- 
portedly are in quarantine. Because people 
can carry distemper virus without show- 
ing symptoms, China’s SFA on 9 January 
banned tourists from close contact with all 
endangered wildlife and instituted health 
checks for employees of breeding facili- 
ties. The authorities also ordered repairs 
on fences around breeding areas to prevent 
dogs and other animals from entering. 

The deaths are a setback for China’s boom- 
ing breeding program, which generates in- 
come from tourism and loaning pandas to 
foreign zoos for large fees. In late December, 
China had 394 pandas in captivity, accord- 
ing to SFA— more than twice the number 
in 2003 and well above a target of 300 rec- 
ommended by the International Union for 
Conservation of Nature for maintaining 90% 
of genetic diversity in the captive 
population for 100 years. (China’s 
goal is now 500 animals.) 

The rise in captive numbers has 
been achieved mainly through 
artificial insemination. Interna- 
tional scientists have advised on 
the breeding process, even help- 
ing centers pair animals for op- 
timal genetic health, but some 
practices that are frowned on at 
foreign zoos persist. For example, 
cubs in the wild stay with their 
mothers until about 18 months, 
but Chinese centers often sepa- 
rate them prematurely so that the 
mothers can breed year after year, 
Loeffler says. The breeding pro- 
gram is “completely geared to- 
ward producing numbers of cubs, 
with no regard for the overall well- 
being of individual animals,” she 
adds. “It’s no different from your 
basic pig or chicken industry.” 

David Wildt, head of the Center § 
for Species Survival at the Smith- g 
sonian Conservation Biology In- < 
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To halt the spread of distemper to | 

pandas, China has banned encounters g 
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INFECTIOUS DISEASES 


Ebola drug trials lurch ahead 

Changing epidemiology and hints of success alter studies 
of experimental treatments 


stitute in Front Royal, Virginia, disagrees. 
Although abuses may have happened at 
some Chinese breeding centers in the past, 
he says, there has been a “shift in philoso- 
phy” that has put “less focus on quantity of 
giant panda cubs and more on quality” 

But some outsiders are critical of another 
practice: taking female cubs from the wild 
to replenish breeding programs. Two of the 
females that died at the Shaanxi center had 
been taken into captivity as 1-year-olds. The 
center claims that the animals were injured, 
but Wang Dajun, a wildlife biologist with 
Peking University’s School of Life Sciences 
in Beijing, says that this is often a pretext 
for replenishing breeding center popula- 
tions with healthy cubs. 

Larger centers have experimented with 
reintroducing captive-bred adults into the 
wild, targeting reserves with small, isolated 
panda populations. “This could increase 
genetic diversity and population size,” says 
Wei Fuwen, a conservation biologist with 
the Institute of Zoology in Beijing. Oth- 
ers say, however, that habitat conservation 
should be the top priority. Pandas migrate 
as the bamboo they eat matures or flow- 
ers, says Colby Loucks, deputy director of 
the wildlife conservation program at WWF 
in Washington, D.C. But because of human 
settlements on the lower elevations of the 
pandas’ habitat, he says, “Pandas are some- 
what boxed in at this point.” 

The Chinese government has established 
habitat corridors connecting some of the 
65 panda reserves, which cover 85% of 
the panda’s natural habitat. “The wildlife 
habitat has been recovered a lot in the core 
area,” Wang says, though he adds that re- 
searchers don’t yet know by exactly how 
much. But in China, as in other countries, 
some reserves are “paper parks,” says Sarah 
Bexell, director of conservation education 
at the Chengdu breeding center. 

The fourth survey will shed light on the 
quality of the protected habitat, Wildt says. 
It includes data on the amount and types 
of bamboo on reserves, which should help 
scientists determine whether they can ac- 
commodate more reintroduced pandas. 

Like the 2002 census, this one tallied the 
population by counting pieces of bamboo 
found in panda feces. The assumption is that 
individual pandas take bites of a character- 
istic size, allowing researchers to avoid du- 
plicating animals in their count. This time 
around, the survey will also rely on analysis 
of fecal DNA to verify some of the findings, 
say scientists familiar with the effort. 

1 It’s unclear when the long-delayed results 

2 will be released; Chinese authorities now 
^ seem focused on getting the canine distem- 
I per outbreak under control. Says Bexell: 
I “We’re all sitting on the edge of our seats.” ■ 
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By Kai Kupferschmidt and Jon Cohen 

E ven the researchers whose trial of a po- 
tential drug for Ebola made headlines 
last week worked hard to downplay 
the glimmer of efficacy it showed. “It 
is a weak signal in a nonrandomized 
trial,” Yves Levy, director of the French 
Institute of Health and Medical Research 
(INSERM) in Paris told Science about the 
data, which INSERM has not released. Weak 
or not, the report in The New York Times that 
favipiravir, a Japanese flu drug, had halved 
mortality in one group of Ebola patients in 
Guinea was one more piece of good news 
that is complicating prospects for trials of 
other Ebola drugs. 

The Guinean government has already an- 
nounced it wants to make favipiravir avail- 
able to more people, and if the results hold 
up to greater scrutiny, they could force a 
change in the design of other clinical tri- 
als going forward. Meanwhile, the decline 
in new cases has investigators revamping 
trials at a time when manufacturers finally 
have enough supplies to test some of the 
most promising experimental drugs. 

The toll of the outbreak ticked up 
last week, as Guinea, Liberia, and Sierra 
Leone— the three most affected countries— 
counted 124 confirmed cases, up from 
99 cases the week before. As the World 
Health Organization’s (WHO’s) Bruce Ayl- 
ward said at a press conference on 5 Febru- 
ary: “The virus has told us this week, loud 
and clear, T am not going to go away the 
way you’re expecting me to.’ ” Yet the num- 
bers represent a sharp drop from the height 


of the epidemic in September when more 
than 700 cases were reported in a single 
week in West Africa. 

Just last week, the Wellcome Trust, a cha- 
rity in the United Kingdom that is funding 
several Ebola trials, announced that the 
Liberian trial of brincidofovir, an antiviral 
developed by Chimerix of Durham, North 
Carolina, would be canceled because the 
company withdrew support. “It was rather a 
surprise to us and a bit of a mystery,” says 
Peter Horby, an investigator at the Univer- 
sity of Oxford in the United Kingdom who 
headed the study. Chimerix said it made the 
decision after discussions with the US. Food 
and Drug Administration (FDA), noting that 
the trial was also having trouble recruiting 
patients. Horby says his group was planning 
to open a second trial site in Sierra Leone, 
where new cases are far higher. Chimerix 
said it was concentrating on completing tri- 
als of the drug to treat other infections: cyto- 
megalovirus and adenovirus. (FDA’s Luciana 
Borio says Chimerix refused the agency’s 
request to make public its correspondence 
with the company.) 

Although one trial is canceled, others are 
about to go forward. Horby says his group 
hopes to start evaluating an RNA inhibitor 
called TKM-Ebola soon. The drug, made by 
Tekmira Pharmaceuticals of Burnaby, Can- 
ada, worked well in monkeys but has been 
in short supply. 

Testing is also about to begin on the anti- 
body cocktail ZMapp. Seen by many research- 
ers as the best shot at treating Ebola because 
of promising monkey studies, ZMapp was 
used on nine patients last summer before the 
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company behind it, Mapp Biopharmaceutical 
of San Diego, California, announced it had no 
more supplies. Now there are enough doses 
to start a clinical trial in Liberia as early as 
this week. But there may be too few patients 
in that country for the experimental drug to 
prove its worth, says Clifford Lane, head of 
clinical research at the U.S. National Institute 
of Allergy and Infectious Diseases, which is 
launching the study in Monrovia with the Li- 
berian Ministry of Health & Social Welfare. 

So far, Guinea and Sierra Leone, where 
Ebola is still infecting dozens of people a 
week, have refused invitations to join the 
study. Their main stumbling block is trial de- 
sign. ZMapp will be the first Ebola treatment 
that will be tested against a placebo control. 
“I think that’s the only way to tell whether 
these drugs are safe and effective,” Lane says. 

The governments of Guinea and Sierra 
Leone, as well as Doctors Without Borders, 
which runs Ebola centers in those coun- 
tries, have for ethical reasons been reluc- 
tant to participate in treatment trials that 
use a placebo. Jeremy Farrar, head of the 
Wellcome Trust, also objects to the random- 
ized, controlled trial design for Ebola drugs, 
given the high mortality rate of the disease. 
“Given the data we have from animals and 
individual patients, I would not feel com- 
fortable being randomized to ZMapp or a 
placebo,” he says. 

Lane notes that the trial may not need 
many participants: If the drug is 100% ef- 
fective and Ebola kills 50% of the people it 
infects, he says, as few as 30 people will need 
to receive ZMapp to determine whether it 
works. And even if there are not enough pa- 
tients to provide a clear answer on efficacy. 
Lane says scientists might still get valuable 
data by looking at parameters like the blood 
levels of Ebola virus in those treated with the 
drug and those in the control arm. 

The favipiravir study in Guinea illustrates 


the complexity of discerning clear answers 
without a robust control. A researcher who 
had seen the data and asked not to be iden- 
tified told Science that favipiravir did not 
help all of the 80 patients treated with it at 
two trial sites in Guinea. In a subset of trial 
participants who had low levels of Ebola vi- 
rus in the blood, however, the mortality was 
just 15%. In similar patients who entered the 
centers earlier and did not receive favipira- 
vir, mortality was 30%. Marie-Paule Kieny 
an assistant director-general at WHO, says it 
is difficult to make sense of the data at this 
point. “You can say it doesn’t mean anything 
or you can say it is promising. More research 
is needed to find out what really happened.” 

The INSERM researchers won’t make 
their data public until 25 February, at the 
Conference on Retroviruses and Opportu- 
nistic Infections in Seattle, Washington. “It is 
important to have a scientific debate about 
what these results really mean,” Levy says, 
noting that the meeting organizers insisted 
the data be embargoed. Meanwhile, the 
study in Guinea is continuing and has now 
enrolled more than 100 patients. “The final 
result may still be different,” Levy says. But 
the preliminary data have already led Guin- 
ean authorities to expand the numbers of 
sites where favipiravir is to be used. 

Other trials could prove harder to orga- 
nize and interpret if favipiravir is distributed 
widely. A study testing the use of convales- 
cent serum started in Guinea this week. “If 
there is a decision now to use favipiravir 
everywhere, what happens with that trial?” 
Kieny asks. The ZMapp trial may also be 
affected. That trial is designed to compare 
ZMapp with the standard of care. “If the 
standard of care changes, so does the control 
used in the trial,” Lane says. But he has not 
seen any results, he says. “The only data I 
have seen from that study are what was in 
The New York Timesl’ ■ 


Ebola treatment trials, coming and going 

One has stopped, others are starting. 


TREATMENT 

MECHANISM 

TRIAL LOCATION 

STATUS 

SPONSORS 

Brincidofovir 

Nucleotide 

analog 

Monrovia 

Stopped 

Chimerix, University 
of Oxford, Wellcome 
Trust, Liberian MoH 

Favipiravir 

Viral enzyme 
inhibitor 

Guinea: Gueckedou, 
Nzerekore, Macenta 

Ongoing, 
signal of 
efficacy? 

INSERM, Guinean 
MoH 

Convalescent 

serum 

Natural 

antibodies 

Gueckedou, Guinea 

Mid-February Belgian Institute of 
launch Tropical Medicine 

ZMapp 

Ebola antibody 
cocktail 

Monrovia 

Mid-February Liberian MoH, U.S. 
launch NIH, Mapp Bio 



UNITED NATIONS 

Sustainable 
goals from 
U.N. under fire 

Scientific review labels 
development targets as 
vague, weak, or meaningless 

By Erik Stokstad 

» sk diplomats from 70 countries to 
chart a course for sustainable devel- 
opment and what do you get? A long 
wish list of vague aspirations, cjmics 
say. The Sustainable Development 
Goals (SDGs), a major policy effort 
now in the works at the United Nations, 

“are fairy tales, dressed in the bureaucratese 
of intergovernmental narcissism,” fumed 
Richard Horton, editor of The Lancet, last 
year. Although that’s an extreme view, 
Horton and other critics have just gained 
some new ammunition. 

In the most comprehensive analysis to 
date, a panel of scientists has reviewed the 
169 draft targets included in the SDGs, which 
range from “equal access to justice” to ending 
poaching of wildlife. The report, conducted 
under the auspices of two international 
scientific organizations and released on 13 
February, offers some tough love. Although ^ 
supportive of the SDGs, it concludes that § 

o 

countries will struggle to achieve them unless S 
the targets are clarified and quantified. The I 
analysis is “one of the very few that’s taken g 
a look at the whole agenda, in a careful way | 
that’s science-based,” says Guido Schmidt- | 
Traub, executive director of the Sustainable I 
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Development Solutions Network, a nonprofit 
that advises the United Nations. “It is an im- 
pressive achievement,” adds Charles Kenny, a 
senior fellow at the Center for Global Devel- 
opment, a think tank in Washington, D.C. 

The SDGs are a follow-up to a well- 
regarded UN. initiative called the Millen- 
nium Campaign, which began in 
2002 and helped bring attention 
and funding to development work 
around the world. The campaign’s 
eight goals caught on because they 
were concise (“End extreme pov- 
erty”) and included just 18 targets, 
for which progress was often mea- 
surable (“Halve, between 1990 and 
2015, the proportion of people who 
suffer from hunger”), says Jan Van- 
demoortele, an independent scholar 
in Bruges, Belgium, who helped cre- 
ate the goals. 

The world has made good prog- 
ress toward those goals, according to 
a UN. interim report last year. The 
percentage of children under the age 
of 5 who were stunted from poor nu- 
trition, for example, dropped from 
40% in 1990 to 25% in 2012. Child 
mortality has declined by almost half 
over the same period. 

More needs to be done, and long 
before this year’s deadline for the 
Millennium Campaign’s goals, the 
United Nations was thinking about 
what would come next. In 2012, at 
the Rio +20 summit, nations agreed 
to create the SDGs to complete any 
unfinished work of the campaign, 
while adding an environmental fo- 
cus to address concerns that eco- 
nomic development may further 
jeopardize the planet. A working 
group representing 70 countries 
spent a year and a half coming up 
with broad goals and targets meant 
to judge progress toward those 


Scientists say U.N. targets should be specific, such as 
halving the number of people who lack enough water. 

goals. By the time the draft was released 
last July, the list had expanded to 17 goals 
with 169 underlying targets. 

One source of input to the working group 
was the International Council for Science 
(ICSU), a nonprofit based in Paris, which 
represents 121 national scientific organiza- 
tions. ICSU has now decided to offer feed- 
back as well. “We thought there was a need 
for a rigorous analysis from a science per- 
spective,” says Anne-Sophie Stevance, a staff 
member, who coordinated the report. Team- 
ing up with the International Social Science 
Council (ISSC), ICSU assembled more than 
40 scientists from various fields who spent 
several months looking over the targets. 

The overall findings: Only 29% of the 
169 targets are well-defined and scientifi- 
cally rigorous. The others lack specific end- 
points and time frames, and some can’t be 
measured accurately, if at all. Many of the 
targets for water and sanitation, for ex- 


Sustainable Development Goals off-target? 

A report says only 29% of SDG’s targets are well-defined (•), 
while 54% need work (#), and 17% are weak or nonessential (•), 
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ample, are ambiguous, not indicating what 
percentage of a population should experi- 
ence improvements by 2030. “Unconvincing 
targets could deter donor agencies,” the au- 
thors warn. They suggest, for example, that 
“substantially reduce the number of people 
suffering from water scarcity”— a current tar- 
get— be sharpened to halve the number. 

Also vague are the targets under Goal 7: 
“Ensure access to affordable, reliable, sustain- 
able, and modern energy for all.” The trouble 
is that access isn’t defined, nor is the level 
considered adequate. “Without being specific 
about this, the goal risks becoming weak and 
subject to loopholes,” the report says. 

Another symptom of fuzziness is that some 
targets are expressed as activities rather than 
endpoints. The goal of reducing inequality 
within and between countries, for example, 
lists open-ended targets such as “improve 
regulation and monitoring of global finan- 
cial markets.” The goal for peace, “Promote 
peaceful and inclusive societies for sustain- 
able development,” is called out as “particu- 
larly vapid.” There should be concrete targets 
for reducing assaults, murder, 
and war, the review asserts. 

All told, 54% of the targets 
need to be strengthened, and 
17% are weak or redundant and 
could be scrapped. The problem 
with vague goals, Vandemoor- 
tele says, is that they allow too 
much wiggle room in evaluat- 
ing progress. “It leaves us at a 
low level of political debate,” he 
says. “It will all be based on ide- 
ology and self-interest.” 

Yet the targets and goals, 
painstakingly negotiated among 
member states, may be hard to 
alter. “There’s a political con- 
cern, a fear that if you open 
any part of this, that the whole 
edifice might come crashing 
down,” Stevance says. Schmidt- 
Traub sees room for sharpening 
and condensing the targets over 
the coming weeks, but Vande- 
moortele is pessimistic about 
improvements beyond damage 
control. “We have passed the 
point of no return,” he says. 

ICSU and ISSC will present 
the report next week in New 
York, while U.N. negotiators 
meet to discuss the overarching 
vision of the SDGs. After a tech- 
nical review, the targets will be 
discussed by the full member- 
ship of the United Nations over 
the summer, with final goals 
and targets announced at a Sep- 
tember summit. ■ 
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PHARMACEUTICALS 

China’s lakes of pig manure 
spawn antibiotic resistance 

Researchers begin to size up a public health threat 
from burgeoning pork production 


By Christina Larson, in Tongxiang 
and Xiamen, China 

E verything about China’s pork industry 
is outsized. The country raises and 
consumes roughly half the planet’s 
pigs— about 500 million annually— as 
the middle class’s appetite for pork 
grows. The pigs produce an estimated 
618 billion kilograms of manure each year. 
And they consume and excrete tens of 
thousands of tons of antibiotics, added to 
their feed to keep them healthy and pro- 
mote growth. Recent studies, including one 
due out soon, are documenting the pre- 
dictable result: a proliferation in China’s 
watercourses of bacteria containing genes 
for resistance to antibiotics. 

It’s a phenomenon familiar to other 
countries with industrialized meat systems, 
but on a whole new scale. “In China, every 
year, we use 150,000 tons to 200,000 tons of 
antibiotics— about 10 times the U.S. usage,” 
says Ying Guangguo, an environmental 
chemist at the Chinese Academy of Sci- 
ences’ (CAS’s) State Key Laboratory of Or- 
ganic Geochemistry in Guangzhou. About 
half goes to human use and half goes into 
feed, largely for pigs. In recent years, the 
Chinese government has taken steps to 
curtail overuse of antibiotics by people 
{Science, 18 May 2012, p. 795). But antibiot- 
ics in animal feed are still poorly regulated. 

Zhu Yongguan, director of CAS’s Insti- 
tute of Urban Environment in Xiamen, 
hopes recent findings will prod the govern- 


ment into action. In a 2013 study published 
in the Proceedings of the National Academy 
of Sciences {PNAS), he and his colleagues 
found troublingly high levels of antibiotic 
resistance genes (ARGs) in manure from 
three large Chinese pig farms, with the 
most common ARGs present at levels hun- 
dreds or thousands of times higher than 
in control samples. The worry, Zhu says, 
is that the ARGs could “be transferred to 
pathogenic bacteria, and end up in clini- 
cal settings that have direct impact on 
humans— so-called superbugs.” 

Now, in what he says is China’s “first 
study looking at ARGs in river water using 
high-throughput molecular tools,” Zhu has 
shown that antibiotic-resistant bacteria are 
proliferating in the Jiulong River, which 
begins in the mountains of Fujian province 
in southern China and courses past rice 
paddies and swine farms. Zhu’s team has 
26 sampling sites along the river, allowing 
them to track where the ARGs enter it. In 
the study, which will soon appear in Ap- 
plied Microbiology and Biotechnology, they 
report that the abundance of ARGs in water 
flowing through the city of Longyan, down- 
stream of the farms, is “around a hundred 
times greater” than near the source. 

The Jiulong River is hardly an isolated 
example. In December, a report on state 
broadcaster CCTV cast a public spotlight on 
antibiotic contamination in several major 
Chinese rivers— including the Yangtze, Pearl, 
and Huangpu, which flows through Shang- 
hai. The report also alleged that Shandong 


Antibiotic-resistant bacteria teem in the Jiulong River 
as it flows through the southern city of Longyan. 

Lukang Pharmaceutical, a large drugmaker, 
was compounding the problem by illegally 
dumping antibiotics into rivers near its pro- 
duction facilities. (Company officials have ad- 
mitted wrongdoing and pledged to clean up.) 

Human overuse of antibiotics, which 
find their way into rivers through sewage 
treatment plants, contributes to the prob- 
lem. But Zhu’s lab can distinguish between 
the resistance genes for antibiotics admin- 
istered only to animals in China, such as 
tetracycline, and those prescribed only for 
humans. He found that farms were a major 
source of the ARGs in the river. 

Simple steps can reduce the presence of 
ARGS in runoff from swine farms, say re- 
searchers who have grappled with the prob- 
lem elsewhere. “The key practical process 
... is properly composting the manure,” says 
James Tiedje, director of the National Sci- 
ence Foundation Center for Microbial Ecol- 
ogy at Michigan State University in East 
Lansing and a co-author of the PNAS pa- 
per. “Compost that reaches sufficiently high 
temperatures will kill a large proportion of 
bacteria.” Zhu believes that Chinese farmers 
are beginning to adopt the practice. 

Even better is to reduce antibiotic use. On 
U.S. farms, improvements in hygiene and 
separating pigs of different ages have cut 
the spread of disease, even while farmers 
reduce the use of antibiotics in feed. Animal 
growth doesn’t suffer much. “We’ve started 
to reevaluate the use of antimicrobials and 
found that the responses are much lower in 
magnitude than earlier claims,” says Steve 
Dritz, a swine nutrition specialist at the Col- 
lege of Veterinary Medicine of Kansas State 
University in Manhattan. Dutch farmers 
recently cut their use of antibiotics in half 
without seriously harming productivity. 

Shen Jianping, an entrepreneur who in 
2006 founded Tongxiang-based Zhejiang 
Huateng Animal Husbandry Ltd. Co., has 
found the same thing. His firm now sells 
about 40,000 metric tons of antibiotic-free 
pig feed annually and also raises about 5000 
pigs on this diet. “The pigs grow more slowly,” 
Shen says, “but can reach the same size.” 

His approach is still very much the excep- 
tion, but Zhu found that it pays off. He took 
samples of pig manure from Shen’s farm, an- 
alyzed them for ARGs, and compared the re- 
sult with those from other Chinese farms. He 
has not yet published his findings but says 
the difference in diet “significantly reduces” | 
ARGs in excrement and thus the risk of con- ^ 
taminating the soil and nearby waterways. | 

As Ying, the researcher in Guangzhou, | 
puts it: “The simplest way to control ARGs g 
in the environment is to reduce inputs.” ■ I 
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CLIMATE SCIENCE 


Fund climate intervention research, study says 

US. needs more science to understand sun-blocking, carbon-removal technologies 


By Eli Kintisch 

T he U.S. government should fund re- 
search in the contentious field of 
climate engineering, the National 
Research Council (NRC) declared 
this week. One tome of the hefty, two- 
volume report examines so-called 
albedo modification— concepts for cool- 
ing the planet by shading it, for example 
by spreading particles in the stratosphere 
or by making clouds more reflective. The 
other scrutinizes carbon dioxide removal 
techniques, which directly reduce the 
amount of greenhouse gas in the atmo- 
sphere, by using giant carbon-sucking ma- 
chines, for instance. 

Neither “intervention” 
is “a replacement for 
reducing carbon emis- 
sions” in combating cli- 
mate change, the authors 
stressed in a statement. 

But they concluded that 
albedo modification— the 
more controversial of 
the options— requires a 
thorough look. And it’s 
“increasingly likely” that 
carbon removal and stor- 
age technologies will 
be needed “to avoid the 
worst impacts of climate 
change.” But any move to 
deploy them “should be 
informed by a far more 
substantive body of sci- 
entific research ... than 
is presently available.” To fill that gap, U.S. 
agencies need “coordinated” research plans, 
says the panel. 

Researchers in the field welcomed the call 
to action. But given the field’s turbulent his- 
tory, they wonder if NRC’s embrace will be 
enough to persuade skittish U.S. funding 
agencies to open their wallets. As early as 
2006, geochemist and Nobel laureate Paul 
Crutzen urged a concerted research effort, 
and a number of scientific societies and ad- 
visory panels have repeated the call. The 
I results have been meager: just a few mil- 
I lion dollars in federal support and about 
I $8.5 million since 2007 from a fund sup- 
§ ported by billionaire Bill Gates. 

^ The new report, from a 16-member NRC 
^ panel led by geophysicist Marcia McNutt, 
i the former head of the U.S. Geological 
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Survey and now editor-in-chief of Science, 
“might translate into government action,” 
says climate scientist David Keith of Har- 
vard University. “It might be the ‘permis- 
sion’ that [federal] program managers feel 
they need to move ahead.” 

Previous funding initiatives have fal- 
tered. In 2001, a draft Department of 
Energy (DOE) plan called for creating a 
$64 million research effort. But the agency 
squelched that proposal, former DOE of- 
ficial Aristides Patrinos said in 2009, be- 
cause it didn’t match agency priorities. 
DOE also feared “adverse publicity, regard- 
less of the merits of the research,” Patrinos 
noted. In Congress, says former Repre- 


sentative Bart Gordon (D-TN), who once 
led the House of Representatives science 
panel, the idea faced skeptics from both 
right and left— from conservative lawmak- 
ers who felt it addressed a nonexistent 
threat and from environmentalists worried 
that geoengineering would sap support for 
emissions cuts. 

The field continued to get the cold shoul- 
der under President Barack Obama. In 2009, 
after news reports suggested that White 
House science adviser John Holdren had put 
such studies in play, Holdren felt compelled 
to shoot down the idea. Among polic 5 miak- 
ers, the message was clear: The topic was 
taboo. It’s telling that the new report was 
originally requested by the CIA, although 
several research agencies, including DOE 
and NASA, ultimately helped pay for it. 

Corrected 12 February 2015; see full text. 
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The report offers few specific guide- 
lines for new funding. But it emphasizes 
that albedo modification potentially poses 
greater risks than carbon removal, because 
it involves large-scale changes to the at- 
mosphere. The panel concludes that ma- 
jor experiments should not advance until 
the government establishes a “deliberative 
process” for weighing environmental risks. 
Both research areas should be managed by 
the cross-agency U.S. Global Change Re- 
search Program, it suggests, embedding 
such studies within mainstream climate 
science. Yet the authors warn that mixing 
albedo modification studies with basic at- 
mospheric research could “have a chilling 
effect” on funding such 
basic studies. Some sci- 
entists think a dedicated 
research program would 
be less likely to “contam- 
inate” climate science. 

Researchers are eager 
to see how the govern- 
ment responds. Keith 
leads a team that wants 
funding to release half 
a kilogram of sulfuric 
acid particles into the 
stratosphere to see how 
sulfate haze— a possible 
sunshade— would affect 
ozone {Science, 18 Octo- 
ber 2013, p. 307). “The 
agencies say they’re wait- 
ing for guidance from 
OSTP [the White House 
Office of Science and 
Technology Policy], but OSTP says the agen- 
cies can do what they want,” he says. (OSTP 
declined to address Keith’s comment.) 

Some hope the report will end such grid- 
lock— and help the United States avoid the 
controversies that have crippled climate en- 
gineering research elsewhere. In the United 
Kingdom, the Royal Society in 2009 recom- 
mended a decadelong, £100 million research 
program. Six years later, it has funded just 
three projects, and the first— which involved 
using a balloon to release water vapor 1000 
meters up— died after controversy over 
patents and opposition from environmen- 
tal groups. The lesson, says physicist Tim 
Kruger of the University of Oxford in the 
United Kingdom, is that “for any proposed 
technique to be deployable, it needs both to 
work technically and socially.” ■ 
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A Canadian firm envisions carbon-sucking fans (artist’s concept) that could help cool the planet. 
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MODELS 


Models predict longer, deeper 
U.S. droughts 

Future western “megadroughts” could be worse than ever 


By Emily Underwood 

T he intricate sandstone ruins of Chaco 
Canyon, New Mexico— a once thriv- 
ing settlement abandoned in the 13th 
century by the ancient Pueblo peo- 
ples during a decades-long “mega- 
drought”— serve as a silent reminder 
to all who live in the arid regions: When 
water supplies dwindle, even sophisticated 
societies may not be able to adapt. 

Now, new research suggests that the se- 
vere, 60-year drought that likely helped 
empty Chaco Canyon was a preview of lon- 
ger, hotter dry spells to come as a result of 
climate change. The Chaco drought will 
look “quaint” compared with what com- 
puter models predict will hit the midwest- 
ern and southwestern United States over the 
next century, says Jason Smerdon, a climate 
scientist at Columbia University’s Lamont- 
Doherty Earth Observatory in Palisades, 
New York, and co-author of the new study, 
published this week in Science Advances. 

Warnings that droughts will intensify as 
the climate warms are not new. But this 
forecast gains credibility, other researchers 
say, because Smerdon and colleagues devel- 
oped detailed forecasts of soil moisture— a 
key drought measure— and put those pre- 
dictions into historical perspective. The 
study is “the most sophisticated effort 
I’ve seen by far to connect records of an- 
cient droughts with projections of future 
change,” says Jonathan Overpeck, a geolo- 
gist and atmospheric scientist at the Uni- 
versity of Arizona in Tucson. 

How global warming will affect specific 
regions is one of the thorniest questions in 
climate science. Previous studies have sug- 
gested that, in a warming world, existing 
weather patterns will intensify, causing wet 
regions of North America to get wetter and 
dry regions drier, with the Southwest expe- 
riencing the worst droughts first. To add 
detail to that picture, the researchers used 
17 state-of-the-art climate models to fore- 
cast three different measures of soil mois- 
ture over the next 100 years as the warming 
climate alters rainfall and speeds evapora- 
tion. The models also took into account 
variables such as wind speed and humidity. 
The results were striking. In nearly every 
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climate model, the projections pointed to 
severe drjdng in the Southwest and Mid- 
west by the end of the century if greenhouse 
gases continue to build up. “If models that 
are all constructed a bit differently all con- 
verge on the same answer, that gives us 
confidence that we are getting the right an- 
swer,” says co-author Benjamin Cook of the 
NASA Goddard Institute for Space Studies 
in New York City. 

To compare projected droughts with those 
of the past, the team turned to a 1000-year re- 
construction of past drought based on more 
than 1800 tree-ring chronologies collected 
across the continent. Tree-ring thickness 
decreases in dry years, allowing researchers 
to use changes in ring width to reconstruct 
drought histories using a metric called the 
Palmer Drought Severity Index (PDSI). A 
proxy for soil moisture, PDSI is used widely 
to assess drought because it can be calcu- 
lated from precipitation and temperature 
measurements alone. But its relative simplic- 
ity has also raised doubts: There’s been “a lot 
of argument over how well PDSI represents” 
the real world, says Gregory Pederson of the 
U.S. Geological Survey in Bozeman, Montana. 

To address that concern, the researchers 
took a careful look at the years between 1931 


and 1990, an interval when historic tree-ring 
data is most complete. They compared tree- 
ring reconstructions of PDSI during that 
period with the models’ predictions of soil 
moisture over the same period and found 
that the two data sets were statistically in- 
distinguishable. The results suggest that the 
PDSI is a more reliable measure of soil mois- 
ture than some have believed. Overpeck says. 

The findings also support previous studies 
suggesting that the 1000-year PDSI record 
accurately reflects past droughts, making 
it a valid benchmark for gauging the sever- 
ity of future ones. Cook says. The projected 
drying trends “are really quite robust and in 
many cases scary when you compare them to 
even the severe megadroughts of the 1100s 
and 1200s.” 

The models probably have weak spots, 
researchers say. Simulating summertime 
precipitation over the Southwest and Mid- 
west, for example, has proved challeng- 
ing. But the fact that the simulations all 
jdelded similar trends is worrjdng, they say. 
Overpeck notes an 80% likelihood that at 
least one decades-long megadrought will 
hit the regions between 2050 and 2100. 
“The only difference is that future mega- 
droughts will be hotter and more severe” 
than those seen in the past, he says. 

Such prolonged dry spells would put tre- 
mendous pressure on surface and ground- 
water resources. And Overpeck worries that 
even big rivers, such as the Colorado, would 
at times “have a very good chance” of running 
dry. The question, he says, is whether drought 
will again drive people from their homes, as 
it did 800 years ago at Chaco Canyon. “There 
may come a time when the droughts are so 
bad we can’t manage them.” ■ 


A drought without respite 

Soil moisture projections averaged across 17 climate models over the next century (green, red, 
and blue lines) compared with 1000-year drought history based on tree rings (brown). 
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WHO ARE THE 

TIBETANS? 

China and some scientists say they 
are Chinese. But others see a richer picture 

By Jane Qju 


E ven after 4000 years, the pose of the 
two skeletons, a woman and a child, 
is eloquent. Clutching the child to 
her breast, the woman looks to the 
sky as if seeking salvation. Nearby, 
a man lies on his stomach, his frac- 
tured legs folded backward. These 
and other poignant remains testify 
to the final moments 4000 years ago 
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of Lajia, a farming village at an altitude of 
about 1800 meters on the eastern edge of Ti- 
bet. Archaeologist Cai Linhai of the Qinghai 
Provincial Institute of Cultural Relics and Ar- 
chaeology in Xining, who is excavating there, 
calls it “the Pompeii of the Tibetan Plateau.” 

Lajia, on the upper reaches of the Yellow 
River, met its end when a mudflow engulfed 
it, perhaps triggered by an earthquake in the 


seismically active region. But profound puz- 
zles remain. Pottery and jade artifacts there 
appear to be derived from the Yangshao cul- 
ture, widely considered the precursor of mod- 
ern Chinese civilization. Were the victims of 
the apocalypse Han Chinese, or close kin? 
And what led their ancestors to settle on the 
harsh, low-oxygen “roof of the world”? 

Like many questions of geography and mi- 
gration, this one has political overtones. Seek- 
ing the high ground in disputes about Tibet’s 
historical relationship to China, the Chinese 
government has seized on recent findings 
that Tibetans and Han Chinese may have 
descended from a common ancestor, diverg- 
ing only in the past several thousand years. 
“The Chinese government is keen to prove 
that Tibetans and Han Chinese are tonggen 
tongyuan— of the same roots and the same 
source— in order to justify its rule in Tibet,” 
says a Chinese archaeologist who requests 
anonymity for fear of political repercussions. 

Some scholars say the evidence supports 
the government’s view. “Tibetans and Han 
Chinese are ultimately the same people, shar- 
ing a common ancestry that separated only 
very recently,” says Ren Xiaoyan, director of 
the Qjnghai institute. Others disagree. “Tibet- 
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ans are not the result of a single ethnic group 
who moved to the plateau and subsequently 
became Tibetans,” says archaeologist Lii 
Hongliang of Sichuan University in Chengdu. 

A spate of recent findings reveals that 
many different peoples sojourned on the 
great plateau. Archaeological evidence 
shows that nomads wandered up river val- 
leys almost as soon as modern humans ar- 
rived in Asia. They came not just from the 
east— the Chinese heartland— but from the 
west and south as well. The Tibetan Plateau 
“may have been a surprisingly cosmopolitan 
place in prehistory— a melting pot for people 
from different directions,” says archaeolo- 
gist Mark Aldenderfer of the University of 
California (UC), Merced. But which of those 
groups finally settled the plateau and be- 
came the present-day Tibetans? 

Some genetic and archaeological studies 
support the idea that the people who met 
their doom at Lajia were recent arrivals 
from China. But other evidence points to a 
more complex and nuanced picture, suggest- 
ing that the nomads who ventured onto the 
plateau thousands of years earlier, from all 
across Asia, left a substantial genetic imprint 
on today’s Tibetans. 

FOUR TIMES THE SIZE OF TEXAS, with 
an average elevation of 4000 meters, the 
Tibetan Plateau is one of the most forbid- 
ding stretches of our planet. Yet the modern 
humans spreading across Asia 40,000 years 
ago and more were apparently not daunted. 
River valleys running deep into the plateau’s 
heart “were probably quite a decent place to 
live,” even during the Last Glacial Maximum, 
a period of intense cold from about 28,000 to 
17,000 years B.P., says Xu Baiqing, a climate 
and environmental scientist at the Institute 
of Tibetan Plateau Research of the Chinese 
Academy of Sciences (CAS) in Beijing. Game, 
wild barley, and other resources there “could 
have sustained small populations for a very 
long time.” 

At Yushuiping, a site 2500 meters above 
sea level on the plateau’s southeastern edge 
in present-day Yunnan province, archaeolo- 
gist Dong Guanghui of Lanzhou University 
and colleagues recently excavated what may 
be the earliest evidence of humans in Tibet: 
stone tools and butchered animal bones 
dating to between 39,000 and 31,000 years 
B.P. The findings, which have not yet been 
submitted for publication, say nothing about 
where those early arrivals came from. But at 
sites on the opposite side of the plateau, in 
western Tibet, stone tools resemble those 
from Nepal dated to 25,000 to 20,000 years 
B.P. The implication, Aldenderfer says, is 
that “people could move onto the plateau 
from the southern Himalayas.” 

They may have arrived from the west, too. 
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Stone tools from sites in western Tibet- 
some dated to 20,000 B.P- are similar to 
ones found in the Altai Mountains, in south- 
ern Siberia. The Altai may have also been the 
source of a genetic adaptation that was key to 
later settlement at high altitudes. In a study 
in Nature last year, a team led by Wang Jun, 
director of BGI in Shenzhen, and population 
geneticist Rasmus Nielsen of UC Berkeley 
showed that EPASl—di gene regulating the 
production of oxygen-carr 5 dng hemoglobin 
in the blood— might have originated in the 
Denisovans, an archaic hominin species 
whose fossil remains are found only in the 
Altai Mountains. They suspect that Deniso- 
vans and modern humans interbred near 
Tibet 50,000 to 20,000 years ago. The very 
earliest migrants might have carried the gene 
to Tibet, or it might have reached the plateau 
later, after spreading across Asia. 

The early Tibetans would have been 
hunter-gatherers, chasing yaks and other 
game, as stone tools and animal fossils at 
the Yunnan site attest. Although some might 
have retreated to lower elevations in winter, 
other groups probably were year-round res- 
idents. “It simply would not be feasible for 
people to move up and down the plateau on 
a seasonal basis” if their settlements were 
in the heart of Tibet, Aldenderfer says. Pre- 
historic Tibetans “could seek refuge around 
the bountiful hot springs in western and 
central Tibet,” says archaeologist Hou 
Guangliang of Qinghai Normal University 


in Xining. They also warmed themselves 
at hearths; Hou and colleagues uncovered 
the oldest known fireplace on the plateau 
at Jiangxigou, a site on the southern edge 
of Qinghai Lake dating to 15,000 years B.P. 

By 9000 to 6000 years ago, people prob- 
ably lived year-round at Quesang, a newly 
excavated site near Lhasa, Tibet’s capital, at 
a dizzying 4200 meters above sea level. Que- 
sang’s inhabitants would have had to travel 
at least 700 kilometers to descend from the 
plateau: too great a seasonal migration even 
for far-ranging hunter-gatherers, Aldender- 
fer says. Such sites indicate that people 
might have settled on the plateau— and be- 
gan adapting genetically and culturally to the 
demands of life there— long before farming 
began, he says. “You don’t have to live every- 
where or stay at the same spot to have a year- 
round existence on the plateau.” 

Over time, a hunter-gatherer lifestyle gave 
way to pastoralism: raising sheep and domes- 
ticated yaks. In Quaternary Science Reviews 
last year, ecologist Georg Miehe of Philipps 
University of Marburg and colleagues sug- 
gested that the vegetation in Tibet began to 
change 8000 years ago as the herds multi- 
plied. They cited several lines of evidence, 
including pollen from lake sediments, to 
show that the dominant grass species in Ti- 
bet emerged about 8000 years ago. They also 
documented charcoal in the ancient soil lay- 
ers, along with a decline of forest pollen, sug- 
gesting that early colonizers of Tibet burned 


The high frontier 

Archaeological sites dot the Tibetan Plateau, showing that people arrived more than 30,000 
years ago and may have been year-round residents by 6000 years ago. More than 3800 Neo- 
lithic sites (not shown) record the spread of farming. 



.ALTAI 

MOUNTAINS 


>4000 m 
2500-4000 m 
1000-2500 m 
<1000 m 


Syr Darya 


• Neolithic sites 

• Paleolithic sites 
••• Migration route 

• •• Possible links 


Yellow 


China 


Qinghai Lake 


Qinghai ' 
provipce 


TIBETAN 

PLATEAU 


Yangtze 


Lhasa 


Quesang 


Indus 


Sichuan 

province 


Yushuiping 


Yunnan 

province 


Ganges 


Irrawaddy 


Mekong 


13 FEBRUARY 2015 • VOL 347 ISSUE 6223 7 0 9 


Published by AAAS 



NEWS I FEATURES 


forests to convert them to grasslands. 

But sustaining larger settlements meant 
growing food: a daunting challenge on the 
parched, cold plateau. Paradoxically, an epi- 
sode of global cooling may have triggered the 
key adaptations. 

Today the high-altitude valleys of Tibet are 
carpeted with barley. But the first farmers 
grew a different crop: millet. At Kamo, a vil- 
lage at 3100 meters that is the oldest broadly 
accepted permanent settlement in Tibet, Si- 
chuan University’s Lii and colleagues recov- 
ered millet dating as far back as 5000 B.P. 
Kamo, on the upper reaches of the Mekong 
River, was a good spot for farming, Lii says— 
“a warmer and wetter part of the plateau.” 

Settling the colder and drier northern 
plateau, however, apparently required dif- 
ferent crops. In a study published last 
month in Science (16 January, p. 248), 
Lanzhou University’s Dong, archaeologist 
Zhang Dongju, and climate scientist Chen 
Fahu, along with colleagues, radiocarbon 
dated charred grains from 53 sites on the 
northeastern Tibetan Plateau. The dates 
showed that up until about 3600 B.P., peo- 
ple lived below 2500 meters and their crops 
were almost exclusively millet. But later set- 
tlers ventured higher onto the plateau and 
grew mostly barley and some wheat. 

“This is counterintuitive,” Dong says. 
“Barley and wheat need more time to ma- 
ture than millet, and so would not seem to 
be a wise choice for the frigid high plateau 
where the growing season is short.” Yet those 
grains are more resistant to frost than millet. 
Dong and colleagues think a global cooling 
beginning about 4500 B.P. and lasting for a 
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millennium may have driven a shift to these 
new cereals, imports from the Near East. 
They in turn allowed farmers to colonize to 
higher elevations. “Wheat and barley not 
only helped them survive the big chill,” Chen 
says, “but expand their range to the heart of 
the plateau.” 

Conventional wisdom holds that people 
would have migrated onto the plateau en 
masse during a warming phase, when veg- 
etation would have been lusher. But the new 
proposal is gaining traction. “This strikes me 
as a very compelling scenario of why you’d 
get a rather late permanent colonization of 
the plateau,” says P. Jeffrey Brantingham, an 
archaeologist at UC Los Angeles who was not 
involved in the study. 

WHO WERE THESE PIONEER FARMERS? 

Historical records only begin with the reign 
of Songtsan Gampo, a warrior who united 
Tibet’s tribes in the 7th century C.E., while 
myths trace the origin of the Tibetans to 
the union of an ogress and monkey on the 
Gangpo Ri— a holy mountain 180 kilometers 
south of Lhasa. But lately, geneticists have 
started to pierce the haze. 

“A hallmark of Tibetans that distin- 
guishes them from other Asian populations 
is their ability to thrive in lofty mountains 
without getting altitude sickness,” says Su 
Bing, a population geneticist at CAS’s Kun- 
ming Institute of Zoology. Most Tibetans 
live above 3500 meters, where air contains 
40% less oxygen than that at sea level and 
rates of low birth weight babies and infant 
mortality are several times higher. “You 
really need to have many generations of 


people spending most of the year on the 
plateau to develop physiological adaptions 
and pass down adaptive genes,” Su says. 

To trace the history of those adaptations, a 
team led by BGI’s Wang sequenced the cod- 
ing region of 92% of the genes in 50 Tibetans 
and 40 Han Chinese. In at least 30 genes, 
they found, a variant common in Tibetans is 
rare in the Han. In the most extreme case, 
a variant of EPASl— the gene linked to high- 
altitude adaptation— was present in 87% of 
Tibetans but only 9% of the Han. A group 
headed by UC Berkeley’s Nielsen then tested 
various scenarios of population history— 
with different assumptions about popula- 
tion sizes, divergence time, and the amount 
of gene flow between peoples— to see which 
best explained Wang’s results. “We found 
that a divergence time of 2750 years ago 
could best reproduce the pattern of genetic 
variations,” says Nielsen, who with Wang and 
colleagues published their findings in Science 
in 2010 (2 July, p. 75). 

That remarkably recent date has met deep 
skepticism. “It contradicts too many things 
we know about Tibet,” Lii says, among them 
the evidence of farming settlements like Ka- 
mo, which dates from at least 2000 years 
earlier. Others take issue with the Science 
paper’s methodology. “The sample size and 
population coverage are too small to tease 
out the complexity of the population history 
of Tibet,” says Jin Li, a population geneticist 
at Fudan University in Shanghai. Nielsen’s 
best fit scenario, which assumes a population 
of 288 Han Chinese and 22,642 Tibetans at 
the time of the split, also strikes many as im- 
plausible. “This simply can’t be tme,” Su says. 

“You can tweak the model to give 
you the pattern of genetic varia- 
tions you want, but this doesn’t 
necessarily mean that the sce- 
nario is realistic.” Nielsen says 
his team has revised its models 
and now thinks that Tibetans 
and Han Chinese probably sepa- 
rated 5000 to 3000 years ago. 

Linguistics also suggests that 
the Tibetan and Chinese people 
share a common, but more dis- 
tant, root. By comparing features 
such as sounds, dialects, and 
word order, William Shi-Yuan 
Wang, a linguist at City Univer- 
sity of Hong Kong, constmcted 
a family tree in 1998 that places 
the Tibetan-Chinese split at 
about 6000 years ago. George 
van Driem, an expert of Himala- ^ 
yan languages at the University | 
of Bern, argues for a similar di- | 
vergence time. After anal 5 ^ing x 
how the languages evolved over g 
time— as new words emerged I 
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Excavations at the 151 locality near Qinghai Lake have revealed elaborate wooden huts where early Tibetans 
may have sheltered themselves from the cold climate 9000 years ago. 
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Global cooling helped expand the range of cold-hardy barley, here being winnowed near Gyangtze in southern Tibet, to the high plateau 3600 years ago. 


and old ones dropped out— he suggests that 
the first Tibetan speakers emerged in west- 
ern Sichuan province, on the eastern edge of 
the Tibetan Plateau, about 7000 years ago. 

But Su postulates a more complex and 
much earlier origin for Tibetans. In a series 
of papers m Molecular Biology and Evolution 
between 2011 and 2013, his team reported 
sequencing DNA from 6109 Tibetans from 
41 villages across the Tibetan Plateau. They 
found Tibetan-specific sequences on the Y 
chromosome and in mitochondrial DNA 
that are absent or occur at very low frequen- 
cies in other Asian populations. Based on 
known mutation rates, Su suggests that the 
sequences began evolving 30,000 to 20,000 
years ago, among early nomads who had 
I settled on the plateau. 

I Sequence diversity can also track past 
I population bottlenecks and expansions. In 

0 their DNA data, Su’s team found signs that 

1 Tibet’s population grew fourfold between 

§ 10,000 and 7000 years ago. Those arguing 

g for a close kinship between the Tibetans 
I and the Han may be at least partly right, 
I Su says. “The data point to a second wave 
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of migration, probably of Han Chinese, 
into Tibet,” he says. “The newcomers most 
likely bred with earlier settlers, giving rise 
to present-day Tibetans.” 

Existing genetic data won’t be enough to 
sort out the puzzles. Researchers agree that 
they need more extensive archaeological data 
from the heart of the plateau, larger genetic 
studies, and clues from ancient DNA. “We are 
still looking for a good specific model of the 
history of Tibet,” Nielsen says. 

REGARDLESS OF THEIR ORIGINS, Tibet- 
ans never were isolated in their mountain 
fastness. They remained part of Eurasia’s hu- 
man melting pot, with new arrivals adding 
to the mix. “People normally see mountains 
as barriers,” says Xinyi Liu, an archaeologist 
at Washington University in St. Louis. But in 
prehistory as today, he says, they entice peo- 
ple from the lowlands with their rich natural 
resources, such as plants and game, tool- 
making materials, firewood, and, most im- 
portant, water. 

Over the past decade, archaeologists have 
documented numerous prehistoric settle- 


ments along what archaeologist Michael 
Frachetti, also of Washington University, calls 
the Inner Asian Mountain Corridors, from 
Central Asia and the Himalayas to Tibet and 
the Altai Mountains. Around 5000 B.P., “you 
begin to see unquestionable connections” 
with clear signs of trade, he says. 

These mountain corridors may well have 
been the precursors of the Silk Road, the 
overland trade routes linking China to South 
Asia, the Near East, Central Asia, and Europe 
starting around the first century B.C.E. The 
mountains were conduits for the exchange of 
genes, goods, and ideas, says Martin Jones, 
an archaeologist at the University of Cam- 
bridge in the United Kingdom. 

Tibet was the heart of that cultural nexus. 
“It’s been a magnet for peoples from differ- 
ent directions for tens of thousands of years,” 
Aldenderfer says. If that’s the case, today’s 
Tibetans emerged from a prehistoric cosmo- 
politanism that, no matter which way the po- 
litical winds blow, cannot be attributed to a 
single culture or ethnicity. ■ 


Jane Qiu is a science writer in Beijing. 
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BIOMINERALIZATION 

Built for tough conditions 

High-resolution structural data help to explain the mechanical properties of rodent teeth 


By Yael Politi^ and James C. Weaver^^ 

1 0 perform such mechanically de- 
manding tasks as biting, chewing, 
or even rock-grinding, animal teeth 
have evolved into a wide range of 
morphologies, constructed from 
minerals as diverse as magnetite, cal- 
cium carbonate, and carbonated hydroxy- 
apatite. Moreover, serial tooth replacement 
or continuously growing teeth with self- 
sharpening abilities have evolved in many 
vertebrates and invertebrates to counteract 
the effects of constant abrasion (see the fig- 
ure). On page 746 of this issue, Gordon et al. 
(1) use rodent and rabbit tooth enamel as 
model systems, combining atomic resolu- 
tion tomography and x-ray spectroscopy to 
elucidate the nanoscale architecture of this 
high-performance biological ceramic. 

Teeth are routinely exposed to extremely 
high loads. Complex three-dimensional 
microstructures with finely tuned com- 
positional gradients enable them to cope 
with these forces; interactions between 
the organic and inorganic building blocks 
largely govern the material’s mechani- 
cal performance. Gordon et al. now map, 
at subnanometer resolution, the three- 
dimensional distribution of a highly doped 
interstitial amorphous mineral phase 
that surrounds the single-crystal fibers in 
dental enamel. 

Tooth structural diversity in vertebrates and 
invertebrates. (A) Gecko, (B) weasel, (C) piranha, (D) 
squirrel, (E) chiton, and (F) sea urchin. Images (A) to (E) 
are scanning electron micrographs: image (F) is a three- 
dimensional reconstruction from microCT data. Inspired 
by the remarkable structural complexity of teeth such as 
those shown here, Gordon etal. elucidate subnanoscale 
variations in structure and composition of dental enamel 
in the continuously growing teeth of rabbits and beavers. 
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The synthesis of many biominerals, in- 
cluding dental enamel (2), involves crystal- 
lization from a disordered precursor phase 
(2, 3). The amorphous layer in dental enamel 
described by Gordon et al. likely results 
from this process {2). Amorphous precursor 
phases were first found in the magnetite- 
containing teeth of chitons (see the figure, 
panel E) {3) and are known today in many 
other groups, including mollusks, echino- 
derms, vertebrates {2), and bacteria (5). 

An amorphous precursor is advantageous 
for many reasons. It is a convenient way of 
transporting ions to the site of mineraliza- 
tion and permits molding of the mineral 
into complex shapes (4). These transforma- 
tion mechanisms, though poorly understood, 
have a bearing on the nanoscale architecture 
and mechanical properties of the crystalline 
product (5). 


^...beaver and rabbU incisors 
[are] excellent model 
systems for[..] investigating 
the complex structural 
hierarchies that have evolved 
to counteract repetitive high- 
energy loading events.” 

Amorphous phases can accommodate 
more impurities than a crystal with well- 
defined lattice and sjmimetry restrictions. 
As a result, impurities from the amorphous 
phase may be trapped in the crystal lattice or 
excluded during crystallization. For example, 
biomacromolecules are often bound to spe- 
cific atomic planes in the crystalline product, 
where they affect its texture and mechanical 
properties {4, 6, 7). It is unclear, however, 
when these molecules are introduced into 
the mineral and if or how they rearrange 
during the phase transformation. 

Inorganic ions that can be accommo- 
dated by the emerging lattice during this 
phase transformation can persist in the final 
crystal and have a direct effect on the min- 
eral’s mechanical performance. In fact, us- 
ing amorphous precursors, sea urchins can 
produce calcite with up to 45 moI% of Mg 
ions in the grinding tips of their continu- 
ously growing, self-sharpening teeth (8) (see 
the figure, panel F). If the impurities cannot 
be accommodated in the crystal lattice, they 
will be excluded by the crystallization front 


^Max-Planck-lnstitute of Colloids and Interfaces, 14424 
Potsdam, Germany. ^H/yss Institute for Biologically Inspired 
Engineering, Harvard University, Cambridge, MA 02138, USA. 
E-mail: yael.politi@mpikg. mpg.de 
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and accumulate at grain boundaries, as seen 
by Gordon et al. 

Most amorphous precursors are hydrated 
phases, whereas the crystalline forms are 
often anhydrous or contain less water than 
their disordered counterparts. During for- 
mation of the sea urchin larval skeleton, for 
example, this dehydration process precedes 
crystallization (P). Kababya et al. have sug- 
gested that water exclusion plays a key role 
in the atomic rearrangement of the amor- 
phous phase {10). 

The incorporation of some additives and 
the exclusion of others during the crystalliza- 
tion of amorphous precursors appear to be 
important features of various biomineraliza- 
tion processes. The excluded additives can 
concentrate at the crystal surface, resulting 
in a thin amorphous layer with altered com- 
position. Such layers have been observed in 
mollusk nacre {11), sea urchins spines {12), 
and now vertebrate dental enamel {!). Gor- 
don et al. show that this boundary layer 
may also critically affect the properties of 
the composite. In rabbit enamel, the Mg- 
rich amorphous calcium phosphate layer 
surrounding the carbonated hydroxyapatite 
fibers renders the enamel more soluble. In 
contrast, the iron-rich amorphous phase 
in pigmented beaver enamel is less soluble 
and exhibits increased hardness, as verified 
through nanoindentation measurements {!). 

Both beaver and rabbit incisors grow con- 
tinuously throughout the life of the animal. 
They are thus excellent model systems for 
investigating the real-time processes of tooth 
development and maturation, the mechani- 
cal consequences of wear and fatigue, and— 
as reported by Gordon et aZ.— the complex 
structural hierarchies that have evolved to 
counteract repetitive high energy loading 
events. The ultra-high resolution character- 
ization techniques employed in the present 
study thus lay the groundwork for the explo- 
ration of similar structure-function relation- 
ships in human tooth enamel with potential 
implications for the prevention and treat- 
ment of dental caries. ■ 
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DRUG DISCOVERY 

Tying up a 

transcription 

factor 

A bivalent small molecule 
inhibits the activity of a 
rogue transcription factor 
in leukemia 

By Andrew Chen^ and Angela N. Koehler^’^ 

T ranscription factors form multipro- 
tein complexes that coordinate gene 
expression and regulate cellular re- 
sponses {!). In cancer, aberrantly ex- 
pressed transcription factors often 
alter gene expression programs, and 
although these transcription factors are 
promising targets for therapeutics {2), they 
remain largely untested. Deemed “undrug- 
gable” {3), their perturbation often requires 
specific disruption of protein-protein or 
protein-DNA interactions. To date, the dis- 
covery or design of small-molecule probes 
and drugs that specifically disrupt such in- 
teractions has proved challenging. On page 
779 of this issue, Illendula et al. {4) describe 
the development of a small molecule that in- 
hibits progression of a subtype of acute my- 
eloid leukemia (AML). The compound binds 
to an oncogenic version of a transcription 
factor subunit called core binding factor p 
(CBFp), which arises from a chromosomal 
translocation. The strategy used to develop 
this drug may be applicable to other onco- 
genic proteins that arise from chromosomal 
translocation events. 

AML is the most common form of adult 
leukemia, accounting for -25% of such cases 
(5). The frontline treatment involves nonse- 
lective chemotherapy, leading to successful 
remission induction for most patients with 
newly diagnosed inv(16) AML. Despite this 
success, new therapeutic approaches are 
sorely needed, as there are few treatment 
options for patients with refractory disease 
or for older patients for whom intensive che- 
motherapy is less favorable {6, 7). In normal 
cells, CBFp binds to the transcription factor 
RUNXl to promote DNA binding at RUNXl 
target gene promoter sites, inducing their 
expression (8). AML cells with the chromo- 
some inversion inv(16) produce a fusion pro- 
tein consisting of CBFp and smooth-muscle 
myosin heavy chain (SMMHC). The fusion 
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protein has a higher affinity for RUNXl than 
does CBFp {9, 10). As a result, CBFp-SMMHC 
sequesters RUNXl from its target genes, de- 
regulates RUNXl-mediated transcription, 
and induces AML. 

Without definitive rules to guide rational 
design, large libraries of compounds are typi- 
cally screened to identify candidate probes 
that modulate a given protein-protein or 
protein-nucleic acid interaction or that al- 
ter transcriptional readouts in cells (5, 11, 
12). Despite a handful of successes— includ- 
ing antagonists of MDM2-p53 interaction 
and DNA-HIF-1 interaction (both complexes 
promote tumorigenesis) {13)— sl systematic 
strategy for the intracellular modulation 
of transcription factor function in vitro or 
in vivo using small molecules has yet to be 
developed. Illendula et at. took advantage of 
the interaction between the oncogenic fusion 
protein CBFp-SMMHC and RUNXl to de- 
velop a small molecule capable of abrogating 
this association. The compound effectively 
sequesters the oncogenic fusion protein, 
thereby allowing RUNXl to bind to normal 
CBFp present in the cell and restore expres- 
sion of RUNXl target genes (see the figure). 


^Department of Biological Engineering, Koch Institute for 
Integrative Cancer Research, Massachusetts Institute of 
Technology, Cambridge, MA, USA. ^Broad Institute of MIT and 
Harvard, Cambridge, MA, USA. E-mail: koehler@mit.edu 


Reduced RUNXl target 
gene expression 


Illendula et al. used a high-throughput 
fluorescence resonance energy transfer 
(FRET) assay (14) to identify a lead com- 
pound that blocked the interaction of the 
oncogenic fusion protein with the RUNXl 
Runt domain. Because CBFp-SMMHC is 
oligomeric in solution (whereas CBFp is 
monomeric), the authors designed bivalent 
analogs of their lead compound with vari- 
ous polyethylene glycol linkers. Using the 
FRET-based assay and a cell viability assay, 
they found that a linker of seven atoms was 
optimal for potency and selectivity— that 
is, this iteration of the lead compound se- 
lectively inhibited the proliferation of a 
human AML cell line (ME-1) with the tar- 
geted chromosome inversion. The bivalent 
compound was further optimized through 
the introduction of trifluoromethoxy sub- 
stitutions to increase its half-life in mouse 
plasma, resulting in the drug AI-10-49. This 
stable compound showed potent impair- 
ment of ME-1 cell viability as well as a high 
affinity for the oncogenic fusion protein. 
AI-10-49 specifically affected ME-1 out of a 
panel of normal bone marrow cells and 11 
human leukemia cell lines (CBFp-SMMHC 
was not present in the unaffected cells), and 
selectively bound the oncogenic fusion pro- 
tein over CBFp in experiments focused on 
monitoring dissociation of CBFp-SMMHC 
and CBFp from RUNXl in response to drug 


Potential treatment for leukemia. The polyvalent 
compound AI-10-49 allows the transcription factor 
RUNXl to dissociate from the oncogenic fusion protein 
CBFp-SMMHC, the underlying cause of inv(16) AML. 

This allows normal interaction of RUNXl with its target 
genes. Treatment with AI-10-49 prolonged the survival 
of mice transplanted with leukemic cells expressing 
CBFp-SMMHC. 

treatment. Additionally, AI-10-49 treatment 
rescued target gene occupancy by RUNXl 
and expression of RUNXl target genes spe- 
cifically in ME-1 cells. 

In more clinically relevant experiments, 
AI-10-49 prolonged the survival of mice 
transplanted with leukemic cells without 
any observed toxic effects. The compound 
also reduced viability and colony formation 
(a readout for hematopoietic progenitor cell 
proliferation) in human primary inv(16) 
AML myeloid blasts (immature white blood 
cells), but not in AML cell samples without 
the chromosome inversion. Comparison of 
AI-10-49 to the monovalent version further 
validated the strategy of using a bivalent 
compound to selectively block the oncogenic 
fusion protein. The specificity of AI-10-49 for 
the oncogenic fusion protein CBFp-SMMHC 
should enable targeted treatment of inv(16) 
AML that is unmatched by the few existing 
small molecules that directly bind to other 
transcription factors (3). 

The polyvalent strategy may serve as a 
template for new drug discovery efforts fo- 
cused on selective modulation of aberrant 
fusion proteins arising from chromosomal 
translocation events, particularly those that 
form homodimers. Although many tran- 
scription factors that are deranged in cancer 
may not benefit from this polyvalent strat- 
egy (because they do not form oligomers like 
CBFp-SMMHC), the example of Illendula et 
al. serves as a proof of concept for targeted 
therapies aimed at dysregulated transcrip- 
tion and should inspire the development of 
additional directed approaches to control 
aberrant transcription factor function in 
cancer and other diseases. ■ 
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Kickstarting a viral RNA polymerase 

Hepatitis C virus RNA- dependent RNA polymerase primes synthesis of its 
RNA genome with an inbuilt proteinaceous primer 


By Stephane Bressanelli 

R NA viruses are a ubiquitous class of 
pathogens that cause serious human, 
animal, and plant disease. To synthe- 
size new viral genomes in infected 
hosts, RNA viruses encode special 
enzymes, RNA-dependent RNA poly- 
merases (RdRps), which are thus prime tar- 
gets for antiviral drug design. A case study 
is hepatitis C virus (HCV). HCV is a major 
human pathogen that chronically infects 
an estimated 170 million people worldwide, 
greatly increasing risk of developing life- 
threatening liver disease, including cirrhosis 
and cancer. New drugs against HCV have 
recently come to market that are extremely 
successful compared to the former standard 
of care and allow cure (that is, eradication of 
HCV from the patient) in over 95% of cases 
(i). Foremost among these new drugs is so- 
fosbuvir, a nucleoside inhibitor targeting 
the active site of the HCV RdRp, NS5B. On 
page 771 of this issue, Appleby et at. describe 


new crystal structures of HCV NS5B {2) that 
are a major advance both in our basic un- 
derstanding of RdRp activity and in the way 
sofosbuvir can inhibit HCV replication. 

When they S 5 mthesize a new nucleic acid, 
most RNA (and DNA) poljmierases do not 
truly “start” S5mthesis of the strand comple- 
mentary to a template strand. Instead, they 
only elongate an existing RNA (or DNA) 
primer molecule. There is an exception, 
though: A ubiquitous class of viral RdRps 
can initiate RNA synthesis de novo, in the 
absence of an RNA primer. The conundrum 
presented by these enz 3 mies is that during 
initiation, they must provide themselves 
with a proteinaceous “priming platform” to 
buttress the priming ribonucleotide that in 
effect acts as a one-nucleotide primer. This 
platform must then be removed to allow 
for subsequent elongation of the de novo 
synthesized short RNA, resulting in a situa- 
tion equivalent to primer-driven nucleic acid 
S 5 mthesis found in other poljmierases (3). A 
mechanism by which this problem has been 


solved was revealed by the x-ray crystal 
structure of the double-stranded RNA reovi- 
rus RdRp lambdas. In this case, the priming 
platform is a small, easily retracted protein 
loop, which allowed several rounds of RNA 
synthesis to be directly visualized {4). 

In the case of HCV NS5B and other re- 
lated RdRps, and unlike the case of reovirus 
lambdas, the exit path of the newly synthe- 
sized RNA is blocked by specific structural 
elements (in green and yellow in panel A 
of the figure, left) (5). Attempts to structur- 
ally characterize the mechanism by which 
the priming platform switches from its 
priming position to allow the transition to 
elongation of the RNA chain have long been 
unsuccessful. The main reason for this may 
seem paradoxical: In isolation, HCV NS5B 
is a self-inhibited RdRp with a low rate of 
initiation and a very poor transition to RNA 
chain elongation, a property clearly related 
to the block in the RNA exit path (d). This 
apparent “bug” in the system is actually a 
useful feature: In HCV-infected cells, a huge 
excess of NS5B is produced, almost all of 
which must be kept inactive. 

How then did Appleby et al. catch HCV 
NS5B in the process of transitioning from 
initiation to elongation (panel A of the fig- 
ure, middle)? The HCV community has 
produced an enormous amount of basic 
knowledge over 25 years of collaborative 
research from both academia and indus- 
try since the discovery of the main caus- 
ative agent of “non-A, non-B hepatitis” (7). 
Appleby et al. cogently used this informa- 
tion and added several clever tricks of their 
own. Thus, they worked with an abnormal 
HCV strain, Japanese fulminant hepatitis 
1 (JFHl), that caused a fulminant hepatitis 
instead of the usual slow course of HCV-re- 
lated disease. This strain has extraordinary 
replicative capabilities (8) due to a much 
higher de novo RNA sjmthesis efficiency of 
its RdRp (9). In addition, they crafted an 
original combination of quasi-substrate and 
catalytic metal ion to lock JFHl HCV NS5B 
in two successive steps of transition from 
initiation (panel A of the figure, middle and 
right). As usual with crystal structures, the 
new data provide a wealth of atomic-level 
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Priming the genome synthesis of an RNA virus. (A) Successive steps in RNA-dependent RNA polymerization by 
HCV NS5B. (Left) NS5B in its preinitiation conformation, with the RNA exit path blocked by the priming loop (in green) 
and carboxyl terminus (in yellow). (Middle and right) The new structures show how the primer loop recedes as NS5B 
opens, guiding the RNA along the opened path as successive nucleotides (in magenta) become incorporated in the 
growing RNA primer (in white) complementary to the template (in black). Among major pathogens, the flavivirus 
polymerase NS5 RdRp (from West Nile virus) (B) and the influenza virus RdRp catalytic subunit PBl (C) also harbor an 
internal priming loop. For clarity, the PBl carboxyl terminus is not displayed. 
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INSIGHTS I PERSPECTIVES 


STRUCTURAL BIOLOGY 

Breaking the intestinal 
barrier to deliver drugs 

The structure of a tight junction protein bound to a 
disruptive toxin may guide drug delivery strategies 


details that in one go both confirm, extend, 
and revise previous knowledge about HCV 
NS5B and related RdRps. 

The new structures definitively establish 
the HCV NS5B priming platform as the “p 
loop” (shown in green on panel A of the fig- 
ure), an insertion in the “thumb” domain of 
viral RdRp. Such a priming loop, which had 
only been previously seen in HCV NS5B and 
its close RdRp relatives in the Flaviviridae 
family (5), has also recently been discov- 
ered in the influenza virus RdRp PBl (10), 
extending the range of major pathogens 
to which Appleby et aUs work is relevant. 
More importantly, Appleby et al. reveal the 
molecular mechanisms by which the p loop 
progressively recedes from the catalytic cleft 
of HCV NS5B, buttressing and guiding the 
nascent template-primer duplex along the 
RNA exit path right after initiation (panel A 
of the figure, middle). 

Another major breakthrough is that, un- 
like in previous work of the same authors 
(11), the new structures are of so-called ter- 
nary complexes; that is, complexes with not 
only RNA but also the incoming, yet to-be- 
incorporated nucleotide at the catalytic site 
(in magenta on panel A of the figure, middle 
and right). The atomic details of incoming 
nucleotide recognition show unexpected 
features that explain how HCV NS5B may 
incorporate, and thus HCV replication be 
inhibited by, modified nucleosides such as 
sofosbuvir. As a final touch, the authors pro- 
vide the structure of an actual ternary com- 
plex with incoming sofosbuvir. 

Appleby et alls study is a technical tour de 
force providing much needed basic insights 
into viral RdRp structure and function, as 
well as the ways by which RdRp may be ef- 
ficiently inhibited. The work was done with 
HCV NS5B (panel A of the figure), which was 
the first viral RdRp of known structure (5) 
and for which effective antiviral drugs now 
exist (1). But it also has far-reaching conse- 
quences for other major RNA viral pathogens 
for which new drugs are needed. These in- 
clude West Nile virus (panel B of the figure) 
and Dengue virus in the same Flaviviridae 
family as HCV (5) but also more distantly re- 
lated RNA viruses such as influenza viruses 
(panel C of the figure) (10). ■ 
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By Per Artursson^ and Stefan D. Kni^t^ 

U any drugs must be absorbed into 
the circulation for medicinal effects 
to occur at the intended sites of ac- 
tion, and so a holy grail of drug de- 
livery is to improve the passage of 
pharmaceuticals across tissue bar- 
riers. Most oral drugs are absorbed in the 
small intestine, where the lumen is lined with 
epithelial cells. Thus, new therapeutic strat- 
egies for efficient oral delivery can benefit 
from a better understanding of the protein 
complexes, such as the tight junction, that 
maintain the integrity of this epithelium. On 
page 775 of this issue, Saitoh et al. (1) report 
the structure of a tight junction constituent 
called claudin-19, bound to a bacterial toxin 
called Clostridium perfringens enterotoxin 
(CPE), an agent that disrupts tight junctions 
and is a major cause of foodborne illness by 
this pathogen. The structural information 
may be useful in developing specific clau- 
din-targeted compounds that improve drug 
delivery across tissue barriers that currently 
limit drug absorption. 

Tight junctions connect adjacent endo- 
thelial cells and control diffusion of solutes 
between them. In the skin, lungs, eyes, and 
gut, these junctions form epithelial barriers 
against harmful agents in the environment, 
and in brain capillaries they form the blood- 
brain barrier that protects the brain against 
leakage of toxic agents from the blood. Skin 
and blood-brain barrier tight junctions are 
largely impermeable, reflecting their es- 
sential barrier function in these tissues, 
whereas in the intestine and other epithelia 
that transport molecules, they display selec- 
tive permeability for ions such as sodium, 
calcium, and magnesium (2). There is ample 
evidence that claudins— a 27-member family 
of transmembrane proteins— are respon- 
sible for the “paracellular barrier” functions 
in epithelia (3). 

The claudins, which are expressed in 
tissue-specific combinations, form the extra- 
cellular compartment of the junctional com- 
plex together with other transmembrane 
barrier proteins such as occludin. Within an 
epithelial cell, claudins are linked to periph- 
eral scaffolding proteins (e.g., ZO-1), that in 


turn are linked to the cytoskeleton (actin 
and microtubules) through linker proteins 
(see the figure). Several of these intercel- 
lular proteins have phosphorylation sites 
that control assembly of the tight junction 
complex and hence the paracellular barrier. 
Tight junctions are dynamic structures in 
that they are sensitive to changes in the lo- 
cal environment— for example, those caused 
by proinfiammatory cytokines and bacterial 
pathogens (2). Knowledge of how these junc- 
tions are regulated and affected by toxins 
such as CPE is essential for understanding 
tissue homeostasis and for tight junction- 
targeted drug development. 


“...structural information 
may be useful in developir^ 
specific claudin-targeted 
compounds that improve 
drug delivery...” 

The paracellular barrier function of tight 
junctions is attributed to several claudins, 
including the CPE-binding claudin-3, -4, 
and -19. However, a paracellular channel 
function has been suggested for others, 
including claudin-2 and -15. Disordering 
of the claudin structure may open either 
or both of two distinct channel popula- 
tions of the tight junction: the pore path- 
way that routes small ions, and the leak 
pathway that allows the passage of drugs 
and macromolecules. In mice, ion pores 
in intestinal tight junctions provide the 
passage of sodium ions into the intesti- 
nal lumen, which is necessary for nutrient 
absorption (4). For drug delivery, claudin- 
targeted agents must open the leak path- 
way in a controlled, reversible manner that 
does not disturb ion transport and tissue 
homeostasis. 
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The crystal structure of 
claudin-15 (which does not 
bind the toxin CPE) revealed 
a five-stranded extracellular 
antiparallel p-sheet domain 
anchored to a transmembrane 
region consisting of a four- 
helix bundle (5). On the basis 
of this structure and muta- 
genesis data, a model for how 
claudins form tight junction 
strands in the membrane by as- 
sembling into antiparallel dou- 
ble rows has been proposed (5, 
6). In this model, in-row inter- 
actions are maintained by con- 
served hydrophobic contacts 
between neighboring claudin 
molecules, while interactions 
across rows are mediated by 
p-strand pairing between ad- 
jacent edge strands of the ex- 
tracellular p-sheet domains. 
Association between tight 
junction strands in adjacent 
cells via loops in the extracel- 
lular domain would result in 
the formation of ion-selective 
paracellular pores, ~10 A in 
diameter, within pore-forming 
claudins (6). 

The binding of CPE to the 
extracellular domain of target 
claudins through the toxin’s 
C-terminal domain (C-CPE) 
leads to disassembly of tight 
junctions and aggregation of 
claudin-C-CPE complexes at 
the intestinal cell surface, as 
well as formation of a large 
transmembrane pore through 
oligomerization of CPE N- 
terminal domains. Calcium 
ion influx through this pore 
into epithelial cells leads to 
cell death. Saitoh et at. provide 
a first glimpse of how C-CPE 
causes tight junction disas- 
sembly in a crystal structure 
of claudin-19 bound to C-CPE. 
The observed interface be- 
tween claudin-19 and C-CPE 
is predominantly hydropho- 
bic, although there are also 
hydrogen-bond and ionic in- 
y teractions that can provide 
I specificity and thus may be 
i useful in directing attempts to 
I identify or design molecules 
I that mimic the interaction be- 
J tween CPE and claudins. 

I Does the structure reported 
^ by Saitoh et at. explain how the 
d binding of C-CPE to claudin-19 


Intestinal epithelial cells 



A 




c ^ 

k 






1 C-CPE 

> 

r 


Drugs and macromolecules 


Exploiting disruption. (A) Tight junction assembly and dynamics are 
controlled by intercellular proteins associated with claudin. (B) Claudin-2 
and -15 provide a pore pathway for maintaining sodium gradients that 
are essential for nutrient absorption from the intestinal lumen. (C) The 
molecule C-CPE binds to claudin, allowing passage of large molecules such 
as drugs, peptides, and proteins across the intestinal epithelium. 


disrupts tight junctions? Unfortunately, no 
structure of claudin-19 without C-CPE bound 
is available, so direct observation of the effect 
of C-CPE binding is not possible. However, 
from comparison with a claudin-19 homol- 
ogy model that is based on the structure of 
claudin-15 (38% identical to claudin-19 by 
amino acid sequence), Saitoh et al. suggest 
that binding of C-CPE causes conformational 
changes in claudin-19 that would disrupt the 
tight junction in-row interactions. The mod- 
eling suggests that steric clashes between 
bound C-CPE and extracellular regions of 
neighboring claudin molecules would also 
contribute to tight junction disruption. 

What are the prospects for using the new 
structural information for drug delivery and 
development? A variety of agents, including 
C-CPE, have been used to obtain more or 
less specific paracellular leaks, often without 
detailed mechanistic knowledge. Structural 
information provided by Saitoh et al. could 
potentially be used to develop safe and more 
specific claudin-targeted agents. Indeed, C- 
CPE, generally considered a nontoxic frag- 
ment of CPE, was recently shown to cause 
liver toxicity (7). 

Improved understanding of C- CPE-medi- 
ated disintegration of the tight junction may 
also assist in the design of compounds that 
target these junctions with higher affinity 
than C-CPE; these could be delivered to tu- 
mors that overexpress claudins, such as pros- 
tate cancer (8). Conversely, compounds that 
mimic the binding site(s) in the extracellular 
domain of claudin maybe developed to block 
CPE-induced fluid accumulation and tissue 
damage caused by C. perfringens infection 
(9). Targeted suppression of claudin-5, result- 
ing in size-selective modulation of the blood- 
brain barrier, alleviates brain edema after 
traumatic brain injury in mice and provides 
proof of the concept that controlled modula- 
tion of tight junctions is possible {10). A re- 
maining challenge is to resolve the extent to 
which the different claudins interact through 
homologous subdomains or through hetero- 
geneous interactions to form tissue-specific 
tight junction barriers. Perhaps the key to 
more specific interactions can be found in 
the latter. ■ 
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INSIGHTS I PERSPECTIVES 


ATMOSPHERIC CHEMISTRY 

Just add water dimers 

Fast reaction with water dimers may limit the impact of the 
simplest Criegee intermediate on atmospheric chemistry 


By Mitchio Okumura 

» lkenes constitute a large fraction of 
the natural and human-made volatile 
organic compounds (VOCs) that are 
emitted into the troposphere. Their 
oxidation products degrade air qual- 
ity and contribute to climate warm- 
ing. Alkene oxidation is thought to involve 
Criegee intermediates (CIs), highly reactive 
molecules that form when ozone reacts with 
alkenes. However, the impact of CIs may be 
limited if they react rapidly with water. Mod- 
elers have found it difficult to quantify the 
effect of CIs on atmospheric composition, 
because laboratory data on Cl reactions with 
water have been contradictory. On page 751 
of this issue, Chao et al. (i) show that the sim- 
plest Cl, formaldehyde oxide (CH^OO), reacts 
rapidly with the water dimer, Similar 

results are reported by Lewis et al. (2). 

Alkene ozonolysis is a common reaction 
in organic chemistry (see the figure) (3). 
Alkenes colliding with ozone initially form 
a primary ozonide, which rapidly dissoci- 
ates to produce a stable ketone or aldehyde 
and a carbonyl oxide: the Criegee interme- 
diate. This highly unusual molecule is both 
very reactive and highly polar. During ozon- 
olysis, some CIs are formed hot and decay 
or react immediately, but a sizable fraction 
becomes stabilized. Scientists have long 
sought to determine the yields of stabilized 
CIs as well as their reactivity (4, 5), but the 
measurements were hampered by an inabil- 
ity to directly detect the elusive intermedi- 
ate in the gas phase. 

In 2008, Taatjes etal. reported the success- 
ful detection of a Cl— formaldehyde oxide, 
CH^OO— in the gas phase with photoioniza- 
tion spectroscopy using vacuum ultraviolet 
synchrotron radiation (d). In 2012, they 
showed that relatively high concentrations 
of CIs could be generated in a short pulse by 
photolyzing diiodomethane, to form 

CHJ, which reacts with to form CHpO 
in a gentle, nearly thermoneutral reaction 
(7). These two discoveries led to a rapidly 
growing literature that includes detection 
of larger CIs, discovery of a strong ultra- 
violet absorption band, and determination 
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of a number of reaction rate coefficients 
(S, 9). 

Experiments that directly observed CIs 
found far higher reaction rates with atmo- 
spheric species such as NO^, SO^, and or- 
ganic acids than did earlier studies. The fast 
reaction with SO^, which oxidizes very slowly 
in the atmosphere, led modelers to propose 
that CIs contribute substantially to sulfate 
aerosol formation in the troposphere (10). 

The evidence for the reaction of CIs with 
water has been contradictory. End-product 
studies done over a decade ago concluded 
that CIs are lost with high yields at high rela- 
tive humidities in environmental chambers. 
These experiments led atmospheric chem- 
ists to assume that CIs are rapidly lost at 
moderate to high relative humidities. How- 
ever, more recent laboratory experiments 
found that CIs react slowly, if at all, with 
water molecules (7, 11, 12). 


In 2004, Ryzhkov and Ariya proposed that 
CIs react not with isolated water molecules 
but rather with water dimers (13). On the ba- 
sis of quantum chemistry and statistical rate 
calculations, they predicted that the water di- 
mer would react 3.5 x 10^ times as quickly as 
the monomer. The enhancement stems from 
the cooperativity of cyclic hydrogen-bonded 
structures, which preferentially stabilize po- 


lar transition states (see the figure), an effect 
well known from water cluster studies. 

Experimental support for enhanced re- 
activity of the water dimer with CIs comes 
from experiments reported in 2014. Berndt 
et al. used a relative rate approach to show 
that CH^OO reacts with water dimers at a 
fast rate that depends quadratically on rela- 
tive humidity (14). However, their rates ex- 
ceeded upper bounds set in two experiments 
that used CH 2 I 2 chemistry (11, 12). 

Chao et al. and Lewis et al. have now suc- 
ceeded in measuring the reaction rate of the 
simplest Cl, CH^OO, with water vapor over 
a wide range of relative humidities. Both 
groups use the CH 2 I 2 chemistry and take 
advantage of the strong UV absorption spec- 
trum discovered by Beames et al. (9) to fol- 
low the decay of CH^OO in real time at much 
higher pressures and larger relative humidi- 
ties than is possible with photoionization 
detection. Both groups find that the decay 
rate depends quadratically on the water va- 
por concentration— evidence that the CIs are 
reacting with the water dimer. By using the 
known equilibrium constant for the water 
dimer, they arrive at similar rate coefficients 
for the bimolecular reaction of CH^OO with 
(H^O)^. Chao et al. recorded more precise 
data over a wider range of relative humidity. 


although systematic errors may arise from 
estimated water concentration or missing 
secondary reactions. ^ 

The new studies (1, 2) validate the oh- | 
servation of rapid water dimer reaction by § 
Berndt et al. (14), although the new rate i 
constants are smaller by a factor of 1.5 to g 
2. The results are consistent with the upper P 
bound found by photoionization detection ^ 
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Alkene oxidation. (A) Alkenes such as ethylene, emitted by both plants and human sources, are oxidized in the 
atmosphere through reaction with ozone, degrading air quality. (B) During ethylene ozonolysis, a cyclic primary ozonide 
decomposes to form the simplest Criegee intermediate (formaldehyde oxide, CH 2 OO) and formaldehyde. Two studies (1, 
2) show that CH 2 OO reacts rapidly with water dimers. The molecular model shows the predicted transition state (13). 
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(7) at very low water concentrations, but lie 
well above the upper bounds set by the two 
other experiments that used chemistry 
(iO, 11). The latter measurements inferred 
Cl concentrations by monitoring secondary 
products as proxies, which might lead to er- 
roneous results if other reactions form the 
same products. 

The observed rates are so fast that, in the 
troposphere, reaction with the water di- 
mer will be the largest sink for CRfiO and 
will limit its steady-state concentration. Of 
course, even small concentrations could con- 
ceivably still oxidize very stable species such 
as SO^. Previous simulations, such as those 
by Vereecken et al. (15), have assumed that 
water reactions are important but have been 
hampered by the large uncertainties in the 
rate coefficients. The atmospheric impact of 
this simplest of CIs can now be quantitatively 
addressed in model calculations. 

Perhaps the most critical remaining is- 
sue concerns the reaction rates of larger CIs 
with water dimers. Quantum chemistry cal- 
culations predict that larger CIs react more 
slowly than CH^OO, at least with one water 
molecule. Vereecken et at., among others, 
have predicted that the larger CIs could influ- 
ence atmospheric composition, for example, 
by causing a substantial increase in sulfate 
aerosol loading (8, 10, 15). However, the es- 
timated rates could be off by orders of mag- 
nitude. Fortunately, the same approach used 
in the current papers can be applied to the 
larger CIs; indeed, Chao et al. present pre- 
liminary data that clearly reveal at least one 
conformer of acetaldehyde oxide, CH^CHOO, 
reacting quickly with water dimers (1). The 
impact of the full range of CIs on atmospheric 
composition remains an open question, but 
one that experiments can now address. ■ 
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ORGANIC CHEMISTRY 

Harnessing weak interactions 
for enantioselective catalysis 

The traditional tools of physical organic chemistry benefit 
from modern data analysis techniques 


By Tongxiang Lu and Steven E. Wheeler 

E lucidating catal 5 rtic reaction mecha- 
nisms is often a challenge, and these 
difficulties are compounded in the 
case of enantioselective catalysts. The 
ability of a catalyst to preferentially 
form one enantiomer over the other 
often hinges on the balance of many at- 
tractive and repulsive nonbonded interac- 
tions that occur in competing transition 
states. On page 737 of this issue, Milo et al. 
(1) combine physical organic and compu- 
tational quantum chemistry with modern 
data analysis techniques to identify these 
interactions. Their predictive mathemati- 
cal models elucidate the underljdng reac- 


tion mechanism and the role of nonbonded 
interactions in these enantioselective re- 
actions, facilitating the rational design of 
more effective catalysts. 

Enantioselectivity typically arises from a 
difference in free energy between compet- 
ing transition states that lead to two pos- 
sible enantiomeric products. Traditionally, 
such free-energy differences were rational- 
ized by the destabilization of the pathway 
leading to the undesired product by repul- 
sive steric interactions. However, there has 
been recent emphasis on the role of favor- 
able noncovalent interactions in enanti- 
oselective reactions (2), and more effective 
catalysts can be designed by preferentially 
stabilizing one pathway over other, un- 
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Catalytic design cycle. Screening of an initial catalyst library yields extensive data sets, which are then distilled into 
predictive mathematical models through multivariate regressions. These models yield new mechanistic insights and 
lead to improved catalyst designs. Abbreviations: Tr, isopropyl: Ph, phenyl. 
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desired pathways (3). The benefit of such 
an approach, which is typical of enzyme 
catalysts, is that it should lead to greater 
overall cataljdic activity while retaining se- 
lectivity Moreover, recent advances in our 
understanding of noncovalent interactions, 
including 7i-stacking and cation-7i interac- 
tions, appear to have laid the groundwork 
for the exploitation of these interactions in 
rational catalyst design {4). 

However, harnessing the power of non- 
covalent interactions for enantioselective 
catalysis has proved difficult. Chief among 
the reasons is the relatively weak, nondi- 
rectional nature of these interactions, ne- 
cessitating the introduction of numerous 


interactions that must operate in concert 
to effectively stabilize the desired reaction 
pathway (5). Rationally designing cata- 
lysts that achieve such coordinated effects 
is fraught with difficulties. Indeed, even 
identifying the noncovalent interactions re- 
sponsible for selectivity in existing catalytic 
reactions, which is a prerequisite for ratio- 
nal catalyst design, is often not straightfor- 
ward based only on experimental data. 

Computational quantum chemistry, in 
which quantum mechanics is used to pre- 
dict molecular properties by describing the 
electronic motion, has proved invaluable for 
understanding chemical reactions and even 
designing catalysts (5, 6). It is routinely used 
to understand enantioselectivities by pre- 
dicting the structures and energies of the 
operative transition states, while also quan- 
tifying the impact of noncovalent interac- 
tions on these structures. Unfortunately, for 
many catalytic reactions, there are simply 
too many potential transition-state struc- 
tures (possibly hundreds) for such analyses 
to be practical. For example, in noncovalent 
catalysis, the catalyst and substrate can in- 
teract in a m 3 Tiad of ways, and many such 
transformations are not amenable to com- 
putational study with this direct approach. 

Milo et al. have effectively circumvented 
this problem by providing a means of pre- 
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dieting enantioselectivities based on simple 
molecular descriptors (such as vibrational 
frequencies and dipole moments) that char- 
acterize the reactants (7). These data are 
easily obtained, obviating the need to com- 
pute all possible transition states for each 
catalyst-substrate combination. Moreover, 
they showed that classical physical organic 
techniques can be effectively combined 
with modern data analysis tools to yield 
insights into the mechanisms of catalyzed 
reactions and the role of noncovalent inter- 
actions in enantioselectivity. 

To demonstrate the power of their ap- 
proach, Milo et al. tackled a particular ex- 
ample of chiral anion catalysis, in which 
enantioselectivity is induced by 
the noncovalent association of 
a cationic intermediate with a 
chiral, anionic catalyst (see the 
figure) (8, 9). To understand 
these reactions, they synthe- 
sized and tested a library of cat- 
alysts exhibiting a broad range 
of enantioselectivities. These 
experiments provided a wealth 
of data regarding the impact of 
steric and electronic factors on 
enantioselectivity, which was 
then distilled into predictive 
mathematical models through multivariate 
regressions. These models unveiled subtle 
factors that control the enantioselectivity 
of these reactions and, ultimately, lead to 
the design of better catalysts. 

By embracing modern data analysis tech- 
niques to enhance the more traditional 
tools of physical organic chemistry, Milo 
et al. have provided a way to harness the 
power of noncovalent interactions for the 
design of enantioselective catalysts. Impor- 
tantly, their approach is general and should 
be applicable to a wide range of catalytic 
reactions. This expands the power of the 
simple linear free-energy relations that 
have long been the workhorse of physical 
organic chemistry, and provides a key step 
toward a future in which big data can be 
used to design small catalysts. ■ 
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“To demonstrate the power of their 
approach, Milo et al. tackled a 
particular example of chiral anion 
catalysis, in which enantioselectivity 
is induced by the noncovalent 
association of a cationic intermediate 
with a chiral, anionic catalyst.’’ 


HEALTH CARE POLICY 

Randomize 
evaluations 
to improve 
health care 
delivery 

Administrative data and 
experimental designs lead 
the way 

By Amy Finkelstein^’^’®^ 
and Sarah Taubman^ 

T he medical profession has long recog- 
nized the importance of randomized 
evaluations; such designs are com- 
monly used to evaluate the safety and 
efficacy of medical innovations such 
as drugs and devices. Unfortunately, 
innovations in how health care is delivered 
(e.g., health insurance structures, interven- 
tions to encourage the use of appropriate 
care, and care coordination approaches) are 
rarely evaluated using randomization. We 
consider barriers to conducting 
POLICY randomized trials in this setting 
and suggest ways for overcoming 
them. Randomized evaluations of fundamen- 
tal issues in health care policy and delivery 
should be— and can be— closer to the norm 
than the exception. 

There is particular interest in improving 
delivery of health care in the United States, 
where the health care sector accounts for 
almost one-fifth of the economy. The newly 
created Patient-Centered Outcomes Re- 
search Institute is providing an estimated 
$3.5 billion in research grants, and the lat- 
est round of Center for Medicare and Med- 
icaid Innovation Health Care Innovation 
Awards provides about $1 billion in research 
grants— much of it aimed at improving the 
delivery of US. health care. 

Studies of US. health care delivery typi- 
cally rely on a range of observational and 
quasi-experimental methods. These can be 
extremely valuable for learning as much as 
possible from existing historical data and for 
studjdng questions that are not amenable to 
randomized designs. For prospective evalua- 
tion of new interventions, however, it is often 
possible to use a randomized design without 
adding substantially to the cost or difficulty 
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of the study. When feasible, randomized 
designs have an unparalleled ability to pro- 
vide credible evidence on an intervention’s 
impact. This can be seen in the outsized 
and enduring influence of the 1970s RAND 
Health Insurance Experiment, a randomized 
evaluation of the impact of health insurance 
in the United States (1, 2). More recently, the 
attention paid to the 2008 Oregon Health In- 
surance Experiment (OHIE), a randomized 
evaluation of the impact of Medicaid (3-6), 
underscores the continued power and influ- 
ence of such randomized evaluations in both 
the academy and public discourse. 

To explore how commonly randomization 
is used in health care delivery studies, we 
examined papers published in a limited set 
of top journals in medicine, economics, and 
health services between 2009 and 2013 [see 
(7) for details on data and methods]. We 
included papers designed to study causal 
effects of an intervention (using either ran- 
domized or other methods). We focused on 
a handful of top journals to capture an illus- 
trative set of high-profile studies; the pic- 
ture may be different across all published 
(and unpublished) studies. We did, how- 
ever, observe similar patterns in reviews of 
trials registered with clinicaltrials.gov and 
of reports from major contract research 
organizations (7). 

On average, 18% of studies of US. health 
care delivery interventions used randomiza- 
tion (see the table). By comparison, 79% of 
studies of US. medical interventions were 
randomized (P-value for comparison < 
0.001). Medical studies involving drugs were 
very likely to be randomized (86%), but ran- 
domization was also common in nondrug 
medical studies (66%). 

Of course, regulatory and funding environ- 
ments in medicine are quite different from 
those in the social sciences. However, we 
found several areas of social science where 
randomization is used far more than in 
health care delivery. In US. education stud- 
ies in top economics journals, 36% were ran- 
domized (P-value for comparison = 0.028). 
More notably, 46% of international develop- 
ment studies in top economics journals were 
randomized (P < 0.001). Even within health 
care delivery, there appears to be more use of 
randomization internationally than within 
the United States. Looking across the same 
journals in medicine, economics, and health 
services as above, 41% of health care deliv- 
H ery studies conducted outside of the United 

1 States were randomized, compared with 18% 
S in the United States (P < 0.001). 
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DATA AND DESIGN. To understand why ran- 
domized trials in US. health care delivery 
have been rare, we turn to some of the chal- 
lenges in conducting such studies. We then 
propose practical approaches to managing 
these challenges. 

We begin with potential ethical consider- 
ations. For medical innovations, randomized 
trials are considered essential in determin- 
ing both safety and efficacy. In health care 
delivery, safety concerns tend to be less 
strong. However, there is often equipoise 
regarding effectiveness. Moreover, it is com- 
mon in health care delivery for promising 
programs to reach only a small fraction of 
the individuals who might benefit. Where 


there are capacity constraints, random as- 
signment can be the most equitable way to 
allocate limited slots. Indeed, the random 
selection used in the OHIE was designed by 
the state in conjunction with stakeholders 
specifically to address fairness concerns (8). 

Another common concern is that ran- 
domized evaluations are prohibitively 
costly, but this does not have to be the case. 
It is true that the typical model for ran- 
domized controlled trials in medicine is 
expensive and time-consuming. Screening, 
recruiting, and obtaining informed consent 
from individual patients before randomiza- 
tion and then collecting follow-up data for 
the purposes of the study is labor-intensive 
and difficult. Historically most random- 
ized trials of health care delivery innova- 
tions have followed this model. Our review 
of randomized studies of US. health care 
delivery published between 2009 and 2013 
in top medical journals (7) found that 80% 
recruited and requested consent from indi- 
viduals and 85% collected primary data. 


A corollary of this labor-intensive approach 
is that randomized evaluations frequently 
focus on very specific patient populations. 
Of the 31 randomized health care delivery 
studies from top medical journals included 
in the table, 77% were convenience samples 
(for example, patients at a single hospital). 
This raises important concerns about their 
generalizability. 

This expensive, time-consuming, and 
convenience-sample approach may be nec- 
essary in most medical trials, where there 
are often real risks to participants. However, 
in most health care delivery interventions, 
there is usually only minimal risk of harm to 
participants. As a result, an alternative ap- 


proach to randomization can produce valid 
causal estimates at substantially reduced 
cost. Randomization is done, with a waiver of 
informed consent, on a set of potentially eli- 
gible individuals, and those who are random- 
ized into the treatment group are offered the 
intervention. All individuals included in the 
random assignment— including those who do 
not accept the offer of the intervention— are 
followed. Low take-up of the program (adher- 
ence to the assigned protocol) does not inter- 
fere with obtaining consistent estimates of 
the program’s causal effects (9). This type of 
randomization design was used in the OHIE; 
those randomly selected were offered Medic- 
aid applications, which allowed us to study 
the impact of Medicaid coverage. 

Although this approach reduces the sta- 
tistical power of the study, it is compatible 
with running large trials and trials with more 
representative samples, because it does not 
require individual recruitment, and individu- 
als can be followed passively in administra- 
tive data. Data collected, used, and stored for 
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Not as many randomized studies as there could be 


TOPIC 

NUMBER OF 
STUDIES 

NUMBER 

RANDOMIZED 

PERCENT 

RANDOMIZED 

U.S. health care delivery 




Top medical journals (subsample) 

62 

31 

50 

Top economics journals 

13 

2 

15 

Top health services journals 

405 

13 

3 

Adjusted average* 



18 

Comparison to medicai innovations in top medical journals 


Medical treatment (U.S. based) 

176 

139 

79 

Medical treatment (international) 

177 

136 

77 

Comparison to other topics in top economics journais 

** 


All other U.S. -based 

192 

14 

7 

Public finance 

50 

4 

8 

Industrial organization 

48 

0 

0 

Labor 

31 

2 

6 

Education 

22 

8 

36 

Other 

57 

4 

7 

International development 

37 

17 

46 

Heaith care delivery studies conducted outside of the U.S. A. 


Top medical journals (subsample) 

30 

19 

63 

Top economics journals 

4 

1 

25 

Top health services journals 

53 

2 

4 

Adjusted average* 



41 


Use of randomization by study topic. The table includes all empirical papers designed to study causal effects of an 
intervention and published in top journals in three fields. Medical: four randomly selected months per year 2009-2013 
for New EnglandJournal of Medicine, Journal of the American Medical Association, Annals of Internal Medicine, and PLOS 
Medicine. We excluded BMJ and Lancet after a preliminary investigation of 4 months of publications found no studies of 
U.S. health care delivery in either journal. Economics: 2009-2013 in the/Amer/can Economic Review, Quarterly Journal of 
Economics, Journal of Political Economy, and Econometrica. Health services: 2009-2013 in Health Affairs, Medical Care, 
and Milbank Quarterly. *The average adjusts for the fact that we reviewed only 20 out of 60 months of medical journals but 
all issues of economics and health services journals. Medical journals typically published more frequently and provided 
ample articles to provide a good estimate, which was then upweighted in the total. **Economics papers may be coded as 
having more than one topic and would contribute to each. See (7) for details. 


reasons other than the study— such as from 
insurance claims, hospital discharges, elec- 
tronic medical records, employment records, 
and mortality records— often include a vir- 
tual census of the relevant individuals. These 
data allow researchers to examine a wide 
range of impacts at substantially lower cost 
than primary data collection. 

Compared with surveys, such administra- 
tive data offer several additional advantages 
besides lower cost. They are less likely to suf- 
fer from bias due to differential nonresponse 
or attrition. They can provide close-to-real- 
time results on the impact of an intervention. 
They can also be used for following up on 
long-term outcomes of the intervention [e.g., 
the impact of kindergarten classes on adult 
earnings in the Project STAR study (iO)]. By 
combining the alternative randomization 
approach with follow-up in administrative 
data, randomized evaluations can be made 
no more costly than the prospective observa- 
tional evaluations that are commonly done in 
U.S. health care delivery. 


A final set of challenges revolves around 
the ways individuals and systems interact. 
Some of the most promising ideas for U.S. 
health care delivery interventions involve 
reforms to entire systems of care. Cluster- 
randomized designs can be a useful tool 
here. Some system-level or comprehensive 
interventions may even be amenable to pa- 
tient-level randomization. For example, inno- 
vations such as including bundling payments 
for episodes of care and creating shared 
saving contracts— major themes in current 
health policy discussions— are often held 
up as examples of something hard to study 
through randomized evaluation. Yet as these 
payment mechanisms expand to take on new 
groups of patients, randomizing which indi- 
viduals are included may be possible. 

Of course, some interventions— including 
individual-level interventions— can have sys- 
tem-wide effects if implemented on a large 
scale. Consider the expansion of insurance 
coverage. A randomized study like the OKIE 
allows us to detect effects of covering a given 


individual with insurance, while holding the 
general health care environment constant. 
However, capacity constraints in the health 
care system may limit effects of market- 
wide expansions, particularly in the short 
run; alternatively, as suggested by quasi- 
experimental work (11), provider responses 
to a market-wide insurance expansion— such 
as adoption of new medical technology and 
changes in practice style— may amplify ef- 
fects of market-wide expansions. In some 
cases, it is possible to design studies that look 
at these broader effects, by randomizing the 
proportion of individuals within the relevant 
unit who are assigned the treatment, as well 
as randomizing which individuals within the 
unit are assigned the treatment (12). In other 
cases, however, such approaches are not 
practical or feasible, and we need to draw on 
other methods. 

More generally, this discussion highlights 
the value of experiments that are actively 
designed by researchers to shed light on 
specific mechanisms. The OKIE was not pro- 
spectively designed by researchers (8) and, as 
a result, leaves much to be debated, such as 
whether an alternatively designed Medicaid 
program could achieve most of the benefits 
but at lower cost [see, e.g., (13)]. By contrast, 
the RAND Health Insurance Experiment 
was prospectively designed by researchers 
to shed light on tradeoffs involved in cost- 
sharing. It used multiple arms to randomly 
vary cost-sharing features of health insur- 
ance that individuals received. It has been 
widely used in policy and academic discus- 
sions of optimal cost-sharing designs. 

Governments, insurers, employers, and 
health care providers are experimenting with 
a wide variety of innovations intended to 
improve health and reduce costs. Increased 
use of randomized evaluations offers a fea- 
sible way to more rigorously measure their 
efficacy and accelerate the pace at which we 
improve the health care delivery system. ■ 
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Botanical brilliance 

Are plants decision-makers 
or elaborate fakers? • 


By Andrew G. Zink* and Zheng-Hui He 

I n Plant Behaviour and Intelligence, 
Anthony Trewavas challenges us to 
leave behind our prejudices and view 
the world from a plant’s perspective. 
Plants, he argues, behave on their own 
time scale, with their own unique phys- 
iology, and solve problems that are equally 
as complex as those confronting animals. 
This book represents a treasure trove of 
fascinating case studies and has the poten- 
tial to serve as an important resource for 
plant physiologists and behavioral ecolo- 
gists alike. 

Behavioral ecologists have long rec- 
ognized the possibility of appl 5 dng their 
models of animal behavior to the plant 
kingdom. The journal Behavioral Ecology 
has invited papers on plant behavior since 
its inception in 1990, but it took more than 


two decades for the first (and only) paper 
on plant behavior to be published (i). The 
journal Plant Signaling and Behavior was 
created in 2006 to bridge this plant-animal 
divide, but so far it has focused almost en- 
tirely on specific mechanisms rather than 
evolutionary adaptations. 

Bridging the worlds of plant and animal 
behavior remains a conceptual (and politi- 
cal) challenge. Trewavas has been persis- 
tent and courageous in his defense of plant 
behavior for many years and maintains his 
position in this book, reasonably assert- 
ing that any “response of an organism to 
a specific stimulus” can be defined as a 
behavior, especially if it is tied to survival 
and reproduction. 

The scale of plant behaviors described 
in this book ranges from individual cells 
to tissues and organs to entire plants. This 
is likely to surprise (and possibly disori- 


Plant Behaviour and 
Intelligence 

Anthony Trewavas 
Oxford University Press, 
2014. 303 pp. 



m 


ent) many behavioral ecologists, who tend 
to reserve the term “behavior” for entire 
organisms. Although Trewavas’s analogies 
would have been much simpler to com- 
prehend if he had focused exclusively on 
whole plants, some of his best are drawn 
between the behaviors of animals and those 
of plant organs. Such unexpected compari- 
sons force us to reconsider our very notion 
of individuality. 

In one example, Trewavas proposes that 
leaves “behave” when responding to quanti- 
ties and qualities of light. In another exam- 
ple, he proposes that roots “behave” when 
they forage along key environmental gradi- 
ents such as gravity, water, and mineral nu- 
trients. But in both of these cases, as well as 
in the many similar analogies, we were left 
wondering if the entire plant is “behaving” 
by guiding a peripheral tissue or if these 
plant tissues are operating as autonomous 
behavioral entities. If the latter is true, how 
might disagreements among plant tissues 
be resolved, and how might this translate 
into what Trewavas calls “intelligent” be- 
havior by the whole plant? 

Most philosophers would be quite skep- 
tical of Trewavas’s claim that plants have 
true intelligence and would have even 
stronger objections to the claim that plants 
exhibit a form of consciousness, as he 
suggests in the book’s final chapters. But 
Trewavas views intelligence as an emergent 
phenomenon of the entire plant, arising 
from coordinated behaviors of individual 
tissues. He argues that this is analogous 
to problem-solving by social insect colonies 
through coordination among workers. This 
is where this book becomes the most chal- 
lenging but also the most exciting in terms 
of generating new ideas. 

In Chapter 11, for example, Trewavas 
purports to show that the cambium layer 
of a tree acts as an “integration assessor,” 
quantifying the productivity of individual 
branches in a way that is similar to how | 
a honeybee colony integrates information ^ 
from various waggle dances by worker | 
bees. He views this as a competitive process ^ 
in which the cambium layer preferentially < 
distributes resources to the most produc- | 
tive branches. We would point out that this ^ 
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MEDICINE 

Stories from the heart 

A history of cardiac research and therapies 


cooperation among plant tissues is likely 
facilitated by the absence of genetic con- 
flicts that are much more common in social 
insect colonies. One notable exception to 
this would be the phenomenon of multiple 
paternity of seeds in a single plant. In Chap- 
ter 17, Trewavas suggests that game theory 
could be used to address seed competition; 
one can easily imagine seeds within and 
across fruits competing for resources, much 
like chicks in a nest begging for food. 

Although admirable in scope and ambi- 
tion, there are a few places in which Trewa- 
vas’s analogies fall short. For example, 
individual workers within social insect 
colonies can easily change tasks and move 
their positions within the colony. Plant cells 
and tissues are static due to their cell walls. 
Therefore, any decision-making behavior 
by the whole plant can only occur through 
growth and decay. This unique constraint 
found in plants is what leads Trewavas to 
claim, “Behaviour in plants is most eas- 
ily seen as changes in form.” We were left 
asking whether plant “behavior” is simply 
prolonged developmental plasticity or if ju- 
dicial self-pruning and regeneration repre- 
sent a true strategy. 

As we read this book, we wondered 
whether the series of analogies with animal 
behaviors sidelined the unique and aston- 
ishing ability of plants to exhibit behaviors 
(e.g., regeneration) that most animals could 
never achieve. Because plants cannot run 
away from danger, for example, they have 
evolved defenses against pathogens and 
herbivores that rival and even exceed the 
sophistication of many animal immune sys- 
tems. One gets the sense that Trewavas, too, 
is frustrated that animal biologists have 
neglected the unique sophistication and 
problem-solving abilities of plants, but it is 
possible that his animal-centric analogies 
may play an unintended role in perpetuat- 
ing this bias. 

There is no doubt that using terminol- 
ogy from the field of animal behavior to 
describe plant behavior may to some de- 
gree downgrade the amazingly unique 
things that only plants can do. However, 
we believe that applying the rich theo- 
retical perspectives of behavioral ecol- 
ogy to plants can only help scientists to 
appreciate and better understand the evo- 
8 lutionary significance of plant behavior. 
§ And at the very least, this book will likely 
g inspire a bit more respect for a kingdom 
I of master problem-solvers who happen to 
a march to the beat of their own (very slow) 
I drum. 
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By Yevgeniya Nusinovich 

S ince prehistoric times, people have 
known that the heart is critical for 
sustaining life, but its true func- 
tion was not well understood until 
recently. The roles traditionally as- 
cribed to the heart included being 
the seat of the mind, the organ of love, and 
the location of the soul. An understanding 
of the heart’s true role in the circulatory 
system took much longer to emerge. In The 
Man Who Touched His Own Heart, evolu- 
tionary biologist Rob Dunn takes read- 
ers on a meandering journey through the 
evolution of human knowledge about this 
mystical organ and ways to treat its many 
maladies. 

Dunn uses his own mother’s recent car- 
diac problems to set the scene for the story 
and, presumably, explain 
his own sudden interest in 
matters of the heart. From 
there, he begins with the 
improbable story of the 
first documented heart 
surgery, performed by an 
African American doctor 
in an understaffed and un- 
derfunded hospital in Chi- 
cago in 1893, before going 
back to the time of Galen, a 
prominent Greek physician 
and philosopher who was 
among the first to attempt 
to understand the heart’s function in the 
ancient world. 

Subsequent events are discussed more 
or less chronologically, with the excep- 
tion of those that occurred in the 20th 
century, when so much was being learned 
and done in parallel. The majority of the 
book is devoted to the events that led to 
the development of cardiac imaging, medi- 
cal treatments for atherosclerosis, and, of 
course, heart surgery itself. 

Among these stories is the tale of the 
German physician Werner Forssmann, who 
inspired the book’s title. Forssmann’s bold 
experiment on his own heart, which I won’t 
spoil here, paved the way for a number of 
new diagnostics and interventions and 
ultimately earned him the Nobel Prize in 
Medicine. 


The reviewer is on staff at Science Translational Medicine, 
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The writing in this book is clear and un- 
derstandable, without unnecessary use of 
medical jargon and with detailed explana- 
tions of all relevant terms. However, it does 
occasionally stray into the overly lyrical and 
cutesy— for example, when Dunn describes 
the heart as “the Mount Everest of the 
body” or “that bloody engine.” Additionally, 
the heavy use of foreshadowing for some 
of the more dramatic events in the story 
sometimes makes the book 
feel like a television drama; 
however, on the whole this 
should not detract much 
from the reader’s enjoy- 
ment. Even when I already 
knew how everything was 
going to turn out (both 
from personal knowledge 
and from the foreshad- 
owing), I found the sto- 
ries to be well told and 
compelling. 

The best part of the 
book, in my opinion, and 
the one that’s likely to be the least familiar 
to the majority of readers, deals with the 
evolution of the circulatory system across 
a range of species, from sea sponges to hu- 
mans. This seems to be the author’s favorite 
topic area, and he writes about it without 
the dramatic flair of his surgical stories but 
with clear love for the subject. 

Despite having been written for a gen- 
eral audience, this book would make great 
reading for medical professionals, as it 
features an extensive collection of medical 
anecdotes and fascinating history (includ- 
ing the occasional bar fight, which is appar- 
ently a common way for someone to end up 
needing emergency heart surgery). I would 
recommend it to anyone who is interested 
in the heart, in medical history, or in dra- 
matic and improbable stories from the field 
of medicine. 
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Edited hy Jennifer Sills 

Cancer risk: 

Role of environment 

THE REPORT “VARIATION in cancer 
risk among tissues can be explained by 
the number of stem cell divisions” (C. 
Tomasetti and B. Vogelstein, 2 January 
p. 78) is dangerously misleading because 
it understates the role of prevention in 
cancer causation. It is widely acknowl- 
edged that many cancers can be explained 
by a two-step process: initiation by one 
or a series of mutations, followed by the 
promotion of the genetic “mistake” to a 
recognizable tumor or blood disease (i). The 
observation that replication of the mistake 
may proceed at different rates in different 
tissues is no doubt correct. However, some 
mutations are initiated by chemical or 
viral exposures, and others occur without a 
known cause. 

Promotion of DNA damage to recog- 
nizable disease occurs in both cases. The 
conclusion that “stochastic effects of DNA 
replication can be... distinguished from 
external environmental factors” is an 
inaccurate statement that rests on a false 
dichotomy. An environmental influence can 
in fact create a DNA change which, if pres- 
ent when the DNA is copied, is subsequently 
“fixed” into the genome as a permanent 
change. The more replications, the less time 
there is for DNA repair to take place before 
the next copying/fixation event. Thus, the 
correlation between frequency of copying 
events and lifetime cancer risks among 
tissues does not imply that environmental 
influences play a lesser role in the causa- 
tion of those same mutations. The fact 
that age-adjusted cancer rates for different 
tissues vary substantially among countries 
where statistics are kept, and between 
workplaces or communities that differ in 
environmental exposures, demonstrates that 
a large fraction of cancers are influenced by 
environmental factors (2). 

What the authors’ work suggests is that 
stochastic differences in effects of DNA 
replication on cancer occurrence in different 
tissues can be distinguished from effects 
of external environmental factors. This 
distinction is far from trivial. Furthermore, 
the conclusion that “[t]he concept underly- 
ing the current work is that many genomic 
changes occur simply by chance during 
DNA replication rather than as a result of 
carcinogenic factors” ignores the fact that 
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an initiation event must have taken place 
for a mutation to be replicated. The paper 
obscures the distinction between differences 
in cancer incidence and differences in occur- 
rence of initiating events leading to cancer. 
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Cancer risk: Tumors 
excluded 

IN THEIR REPORT “Variation in cancer 
risk among tissues can be explained by the 
number of stem cell divisions” (2 January 
p. 78), C. Tomasetti and B. Vogelstein 
discuss an interesting correlation (0.804) 
between estimated lifetime stem cell 
division number in 31 tissue types and 
corresponding cancer incidence rates in 
the United States. However, their assertion 
that only 35% of cancer risk variation is 
due to environmental or genetic factors is 
problematic. 


The correlation analysis excluded many 
cancers (such as stomach, breast, prostate, 
cervix, kidney, endometrium, bladder, 
and lymphoma) that are common in the 
United States or worldwide, so no state- 
ment about overall cancer rate variation 
that is “explained” by stem cell divisions 
can be made. Furthermore, the correlation 
was anchored by five data points for osteo- 
sarcoma and included tumor subtypes 
having genetic (colorectal) and environ- 
mental influences (lung), but stem cell 
division rates were not estimated sepa- 
rately for organ subtypes. There are strong 
time trends in cancer incidence rates and 
large incidence-rate variations interna- 
tionally for nearly all cancer t 5 q)es [for 
example, the rate of squamous esophagus 
cancer among men with the high- 
est incidence (Jiashan County 
in China and African 
Americans in South 
Carolina) is more than 
100 times the rate 
among men with 
the lowest inci- 
dence (Algeria) 
(i)]. If interna- 
tional rates were 
added to Figure 
1, a much smaller 
fraction of inci- 
dence rate variation 
would be explained 
by stem cell divisions. 
Moreover, as the authors 
note, “The total number 
of stem cells in an organ and 
their proliferation rate may of course be 
influenced by genetic and environmental 
factors,” so that stem cell division numbers 
could serve, substantially, as a mediator 
of genetic and environmental influences, 
rather than a distinct etiologic factor. 

Finally, high values of the authors’ extra 
risk score (ERS) are described as arising 
when “there is high cancer risk relative 
to the number of stem cell divisions,” but 
ERS is calculated not as the ratio, but as 
the product, of cancer incidence rates 
and stem cell division number. Hence 
the resulting classification into D and R 
tumors does not seem interpretable and, 
regardless, could aim only to identify 
tumors that have etiologic mechanisms 
other than stem cell division number. 
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Cancer risk: Role of 
chance overstated 

THE RECENT ASSERTION from C. Tomasetti 
and B. Vogelstein that most variation in 
cancer risk among tissues is due to “bad 
luck” demands close consideration, espe- 
cially as they go on to argue for increased 
focus on early detection (“Variation in can- 
cer risk among tissues can be explained by 
the number of stem cell divisions,” Reports, 
2 January, p. 78). Observations from cancer 
epidemiology and limitations in their 
analysis argue strongly against this con- 
clusion. Most cancers show considerable 
differences in incidence rates between dis- 
tinct populations. Rates change over time, 
and migrants soon exhibit incidence rates 
similar to their host country. Each of these 
is consistent with a major etiologic role for 
environment and lifestyle. Consequently, a 
majority of cancers are preventable, with 
primary prevention achieving notable suc- 
cesses and promising more (i). 

In their analysis, the authors correlate 
total stem cell divisions in selected organs 
or sites, and lifetime risk of a particular 
cancer at those sites. There is much uncer- 
tainty in the estimates of total stem cell 
divisions for each cancer site, and the vast 
age-related fluctuations in cell division for 
some tissues are overlooked. Of 
greater concern is the life- 
time risk of cancers. Their 
analysis excludes fre- 
quent cancers with 
major environmen- 
tal causes (such as 
stomach, breast, 
and cervix) and 
oversamples 
cancers rare in 
all populations 
(such as osteo- 
sarcomas, small 
intestine, and 
medulloblastoma). 

Overall, the cancer 
sites included account 
for only 34% of the cancer 
cases in the United States (2). 

The choice of the US. population is also 
arbitrary. A different population with 
different cancer patterns would have pro- 
vided different results. 

We also take issue with the statistical 
analysis. Despite the reported correla- 
tion of 0.81, stem cell replication is a poor 
predictor of incidence rates at any given 
cancer site. The residual standard devia- 
tion of the log rates is 0.75, so the 95% 
confidence limits for the log rate of any 


cancer site are given by the linear predic- 
tor ±1.47 (i.e., 1.96 x 0.75). Converting from 
a loglO scale to an absolute scale gives an 
error factor of 10^ ^^=29.4; i.e., the incidence 
rate may be 30 times higher or 30 times 
lower than the value predicted by stem cell 
division rates alone. This residual variation 
is consistent with large effects of environ- 
mental and lifestyle factors. 

The role of chance underl 3 dng the onset 
of any individual cancer has long been 
recognized (3). However, although impor- 
tant for the individual, chance has little to 
say about the incidence rate in a popula- 
tion, or differences between populations. 
These are far better explained by exposure 
to environmental and lifestyle factors, 
allowing important opportunities for, and 
supporting implementation of, primary 
prevention. 
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Cancer risk: 
Prevention 
is crucial 

AS CANCER 
prevention sci- 
entists, we read 
C. Tomasetti and 
B. Vogelstein’s 
Report “Variation 
in cancer risk 
among tissues can 
be explained by the 
number of stem cell 
divisions” (2 January, 
p. 78) with considerable 
interest. Many of the findings 
support previous research: Cancers vary in 
preventability and the cancers that cause 
the most mortality in developed countries 
(lung and colon) are highly preventable 
(i). However, other findings in the Report 
do not reflect the current evidence. 

For example, many of the “R-tumor” 
type cancers that the authors h 3 q)othesize 
to be unlikely to be preventable have 
well-known modifiable risk factors, such 
as tobacco and alcohol use for esophageal 


and head and neck cancers, radon expo- 
sure for lung cancer in nonsmokers, and 
ultraviolet light exposure for melanoma 
(Jf). There is also well-documented varia- 
tion in cancer incidence rates for these 
and other cancers, globally and due to 
migration, as well as over time (7). These 
kinds of changes do not seem to be com- 
patible with the theory that these cancers 
originate primarily from random stem cell 
mutations. 

Tomasetti and Vogelstein found 
an interesting statistical relationship 
between rates of stem cell division and 
cancer rates in selected tissues, but they 
overinterpret the results by impljdng 
a causal relation. Emerging evidence 
suggests that stem cell division rates, 
and errors in division, are not simply a 
product of time and chance; they vary due 
to many external influences, including 
obesity, environmental pollution, infec- 
tions, and inflammation (2, 3). 
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Cancer risk: Many 
factors contribute 

IN THEIR REPORT “Variation in cancer 
risk among tissues can be explained by the 
number of stem cell divisions” (2 January, 
p. 78), C. Tomasetti and B. Vogelstein found 
a high correlation between the number of 
lifetime stem cell divisions of a given tissue 
and the lifetime risk of cancer in that tis- 
sue. Based on the finding that 65% of the 
variation in cancer risk among different 
tissues can be explained by the number 
of stem cell divisions in those tissues, 
the authors concluded that “these results 
suggest that only a third of the variation 
in cancer risk among tissues is attribut- 
able to environmental factors or inherited 
predispositions.” This conclusion presumes 
that the total contribution of different 
components to variation in cancer risk 
among tissues adds up to 100%. However, 
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most cancers are caused by multiple 
overlapping factors, and the attribut- 
able fractions for individual factors can 
add up to more than 100%. Furthermore, 
Tomasetti and Vogelstein suggest using the 
extra risk score (ERS) to direct allocation 
of primary versus secondary prevention for 
different cancers. However, although the 
ERS indicates how important the stochas- 
tic effects of DNA replication are for the 
variation in cancer rates across organs, it 
does not inform about the preventability 
of a certain cancer in the population. As 
shown in Figure 1 in the Report, a wide 
variation in cancer rates exists even within 
highly proliferative tissues, indicating a 
substantial role of non-stochastic factors 
in carcinogenesis (such as sun exposure for 
melanoma, tobacco for lung cancer, viruses 
and obesity for hepatocellular carcinoma, 
and obesity and tobacco for pancreatic 
ductal adenocarcinoma) and an enormous 
potential for primary prevention. The 
proportion of cancer cases that can be 
potentially prevented by environmental 
(mainly lifestyle) modification should be 
estimated on the basis of the comparison 
of cancer rates across populations with 
different risk factor profiles (1, 2), rather 
than the comparison of cancer rates across 
tissues within individuals. 
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Cancer risk: Accuracy 
of literature 

WE READ WITH INTEREST the recent 
cancer etiology Report “Variation in cancer 
risk among tissues can be explained by 
the number of stem cell divisions” (2 
January, p. 78), in which C. Tomasetti and 
B. Vogelstein claim that most cancer risk 
can be explained by chance mutations. 
However, the selection criteria used for 
cancer types included in the study are not 
robust. First, the authors report using an 
“extensive literature search” to identify 
eligible tissue types. There is no evidence 
that a systematic literature review was 
conducted. Second, the assessment of 
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literature quality and subsequent inclusion 
criteria is not clear. According to the 
authors, “Other cancer types could not be 
assessed, largely because details about the 
normal stem cells maintaining the tissue 
in homeostasis have not yet been agreed 
upon or accurately quantified.” There have 
been volumes written about the necessity 
of systematic literature reviews and 
subsequent appraisal as a criti- 
cal component of obtaining 
accurate and unbiased 
research results (i). 

The method used 
by Tomasetti and 
Vogelstein leads to the 
exclusion of breast 
and prostate cancer, 
together accounting 
for -25% of all newly 
diagnosed cancers (2). No 
doubt other cancer types are 
excluded as well. Breast and 
prostate cancer have been closely 
studied, in many cases to a much greater 
extent than those cancers that the authors 
select. Lack of agreement regarding 
accurate quantification of these cell types 
should be addressed by sensitivity analysis 
rather than exclusion. Large bodies of 
literature will invariably contain disagree- 
ment between authors. This is hardly 
justification for exclusion. 
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Response 

THESE LETTERS CONTINUE the healthy 
and intelligent debate among scientists 
and the public about the root causes of 
cancer and the best way to reduce cancer 
deaths. The debate hinges on the follow- 
ing question: What causes the mutations 
that are responsible for cancer? Two 
causes— environmental and hereditary fac- 
tors— have long been recognized. A third 
cause— mutations that arise during normal 
stem cell divisions in the absence of exog- 
enous factors— was also known, but there 
had been no way to measure the relative 
importance of these mutations in cancers 
and compare them to the other causes. 

Our analysis enabled such a measurement, 
and we found evidence for a surprisingly 


large role of these mutations, henceforth 
called replicative mutations. 

Suppose we had discovered a muta- 
genic, industrial agent that was present 
in human tissues at concentrations that 
were very highly correlated with can- 
cer incidence. The implications of this 
discovery would be obvious. But such 

an imaginary discovery is highly 
analogous to the one reported 
in our paper. The difference 
is that the “agent” is not 
exogenous. Replicative 
mutations are unavoid- 
able. They are in a 
sense a side-effect of 
evolution, which can- 
not proceed without 
them. That they play 
a larger role in cancer 
than previously believed 
has important scientific and 
societal implications. 

At least three reactions to our 
paper have emerged. To some, the idea 
that we cannot completely control our 
cancer destinies by living a perfect life- 
style in a perfect environment, even when 
we have no hereditary predisposition to 
cancer, has proved unsettling. To oth- 
ers, our paper had a completely different 
message. That a child has cancer is bad 
enough; that a parent may feel guilty 
for failing to avoid a certain life-style or 
environment, and thereby “causing” that 
cancer, is agonizing. We chose to use the 
word “bad luck” particularly because we 
were aware of the unjustified guilt felt by 
many patients and their families about 
cancers that were beyond their control. 

The third reaction is fear that recognition 
of a major role for “bad luck” in cancer 
could lead individuals to conclude that 
all t 5 q)es of cancer are unpreventable and 
there is nothing they can do to avert any 
of them. We and many others, including 
those who have written Letters to Science, 
have vigorously campaigned against this 
mistaken belief (1, 2). 

Ashford et at. state that “some mutations 
are initiated by chemical or viral exposures, 
and others occur without a known cause,” 
leaving open the possibility that other 
mutations are caused by chemical or viral 
exposures that have not yet been identified. 
The views of Ashford et al. stem from influ- 
ential studies carried out in 1947 in which 
mice were treated topically with a single 
dose of a strong mutagen (i.e., initiator), 
followed by repeated topical doses of croton 
oil (i.e., promoter) (3). Ashford et al. thus 
state that our study “ignores that fact that 
an initiation event must have taken place 
for a mutation to be replicated.” In contrast. 
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our view is that no exposure to an exog- 
enous agent is required for tumor initiation. 
Replicative mutations can be responsible for 
either initiating the process of tumorigenesis 
or for driving tumor progression. 

Potter and Prentice and Wild 
et al suggest that if we had 
been able to include 
other cancer types, 
particularly com- 
mon cancer types 
such as those of 
the prostate and 
breast, we might 
have concluded 
that less than 
two-thirds of 
the variation in 
cancer risk across 
tissues is ascribable 
to replicative mutations. 

We stated in our Report 
that we could only include 
cancers in which normal stem cells 
had been well-characterized, and agree that 
this was a limitation of our study. However, 
Cancer Research UK estimates that no cases 
of prostate cancer and only 27% of breast 
cancers are preventable (4). Therefore, once 
adequate research on the stem cells in these 
organs is performed, we expect that the 
inclusion of these cancers will not signifi- 
cantly affect the correlation coefficient we 
observed [see (5) for more details]. 

We agree with Potter and Prentice and 
Wild et al. that the evaluation of data from 
other countries in the same way will be 
valuable. However, those data will not affect 
our conclusion that “stochastic effects asso- 
ciated with DNA replication contribute in a 
substantial way to human cancer incidence 
in the United States.” Although replica- 
tive mutations are expected to vary little 
among populations, inherited mutations 
and environmentally based mutations are 
known to vary considerably. For example, 
in a country where everyone smokes and 
is obese, the correlation between stem cell 
divisions and cancer rates will be far lower 
than 0.80 because avoidable factors play a 
greater role. 

Potter and Prentice criticize our multi- 
plication of two logarithms to derive extra 
risk score (ERS). It may seem unintuitive 
to multiply rather than add logarithms, 
but both are valid mathematical operations 
to apply, with different interpretations. A 
detailed explanation of the mathemati- 
cal basis of the ERS was provided in the 
Supplementary Materials and is expanded 
upon in our Technical Report (5). 

Potter and Prentice and Gotay et al. state 
that genetic and environmental influences 
could influence the total number of stem 



cell divisions. We agree, which is why we 
defined replicative mutations to exclude 
such effects. Replicative errors occur at 
rates that can be measured in totally nor- 
mal cells in vitro in the absence of 
any carcinogens. Carcinogens 
and hereditary factors 
add extra mutations 
to the baseline level 
established by the 
unavoidable repli- 
cative mutations. 

Wild et al. state 
that “a major- 
ity of cancers 
are preventable.” 
The Centers for 
Disease Control 
(CDC) estimates that 
21% of cancer deaths 
are potentially prevent- 
able in the United States 
(d). The most recent estimate 
from Cancer Research UK is that 
42% of cancers in the UK are prevent- 
able (4). These two organizations, as well 
as the World Health Organization Wild 
et al. represent, are committed to cancer 
prevention efforts and to identifying and 
implementing strategies to reduce cancer 
risk. Nothing in our study contradicts 
their estimates of potentially preventable 
cancers. To the contrary, our data provide a 
mechanism to help understand the molec- 
ular basis for the CDC’s estimate (as noted 
above, our cancer incidence data were 
derived from a US. population). 

Wild et al. also state that 
“although important for 
the individual, chance 
has little to say about 
the incidence rate in a 
population” and com- 
ment about prediction 
of “incidence rates at any 
given cancer site.” Our 
results specifically demon- 
strate that chance plays an 
important role in the incidence 
rate in a population. Our approach 
explains variation in cancer incidence 
across tissues, rather than providing pre- 
diction at any particular cancer type. 

Wild et al. support their claim that 
“the role of chance... has long been rec- 
ognized” with a reference to the classic 
studies of Armitage and Doll (7). This 
claim illustrates that the role of replica- 
tive mutations in cancer is not adequately 
appreciated, even today. Armitage and 
Doll’s work was directed to understanding 
“carcinogenesis” considering “the ages at 
which the subjects are exposed” to various 
carcinogens. There are no such exposures 



required for replicative mutations. 

Song and Giovannucci state that the 
“attributable fractions for individual fac- 
tors could add up to more than 100%.” 

The potential causes of mutations, and 
therefore cancer, can be partitioned in two 
subsets: factors related to the number of 
stem cell divisions and factors unrelated to 
those divisions. Thus, by assumption, these 
two causes add up to explain exactly 100% 
of the variation in risk. 

We agree with Song and Giovannucci 
that the preventability of specific cancer 
types is more precisely estimated by epi- 
demiologic evaluations than by ERS. The 
ERS provides a rough idea of the potential 
preventability of individual cancer types, 
but only in relation to other cancer types 
rather than in absolute terms (5). At the 
same time, our work provides a way to 
calculate the evidence for such extra risks 
that is free from all assumptions used 
previously. The idea that two-thirds of the 
relative variation in cancer risk can be 
explained (correlation coefficient of 0.80), 
and the relative environmental or heredi- 
tary influences roughly estimated, from a 
single biological feature (number of stem 
cell divisions) is unprecedented. 

With respect to comments of 
O’Callaghan, our study was not intended 
to be a meta-analysis such as that used to 
evaluate clinical interventions. We used 
PubMed to find all the references we could 
and used our judgment to select 146 that 
we considered among the most reli- 
able. However, we did not have 
complete trust in our judg- 
ment, nor complete trust in 
the estimates made in the 
original references. We 
therefore performed rig- 
orous tests for robustness 
of the conclusions based 
on these estimates. For 
example, statistical signifi- 
cance persisted even when 
we allowed the reported 
estimates of stem cell divi- 
sions to vary by ~ 100-fold in either 
direction [see our Report, Supplementary 
Materials]. Few meta-analyses would sur- 
vive robustness tests like these. 
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TECHNICAL COMMENT 
ABSTRACTS 

Comment on “Using ecological 
thresholds to evaluate the costs and 
benefits of set-asides in a biodiversity 
hotspot” 

Christopher Finney 

Banks-Leite et al. (Reports, 29 August 
2014, p. 1041) conclude that a large-scale 
program to restore the Brazilian Atlantic 
Forest using payments for environmental 
services (PES) is economically feasible. 
They do not analyze transaction costs, 
which are quantified infrequently and 
incompletely in the literature. Transaction 
costs can exceed 20% of total project costs 
and should be included in future research. 

Full text at http://dx.doi.org/10.1126/sci- 
ence.aaa0916 

Response to Comment on “Using 
ecological thresholds to evaluate the 
costs and benefits of set-asides in a 
biodiversity hotspot” 

Cristina Banks-Leite, Renata Pardini, Leandro 
R. Tambosi, William D. Pearse, Adriana A. 
Bueno, Roberta T Bruscagin, Thais H. Condez, 
Marianna Dixo, Alexandre T Igari, Alexandre 
C. Martensen, Jean Paul Metzger 

Finney claims that we did not include 
transaction costs while assessing the 
economic costs of a set-aside program in 
Brazil and that accounting for them could 
potentially render large payments for envi- 
ronmental services (PES) projects unfeasible. 
We agree with the need for a better under- 
standing of transaction costs but provide 
evidence that they do not alter the feasibility 
of the set-aside scheme we proposed. 

Full text at http://dx.doi.org/10.1126/sci- 
ence.aaal602 
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RESEARCH 


TECHNICAL RESPONSE 


CONSERVATION ECONOMICS 

Response to Comment on “Using 
ecological thresholds to evaluate the 
costs and benefits of set-asides in a 
biodiversity hotspot” 

Cristina Banks-Leite/’^* Renata Pardini,^ Leandro R. Tambosi,^ William D. Pearse,^ 
Adriana A. Bueno, ^ Roberta T. Bruscagin,^ Thais H. Condez,® Marianna Dixo,^ 
Alexandre T. Igari,’^ Alexandre C. Martensen,® Jean Paul Metzger^ 

Finney claims that we did not include transaction costs while assessing the economic costs 
of a set-aside program in Brazil and that accounting for them could potentially render 
large payments for environmental services (PES) projects unfeasible. We agree with the 
need for a better understanding of transaction costs but provide evidence that they do not 
alter the feasibility of the set-aside scheme we proposed. 


A s Finney points out {!), transaction costs 
are rarely quantified and/or reported in 
the payments for environmental services 
(PES) literature, and this is particularly 
true for the Atlantic Forest. Finney cor- 
rectly suggests that transaction costs were ig- 
nored in Banks-Leite et al. (2) due to a lack of 
existing data and proposes four questions that 
should be answered before we can understand 
the actual feasibility of PES projects. The ques- 
tions posed by Finney should certainly be used 
to guide future research on the feasibility of PES 
at large scales, but their answers require col- 
lecting additional data. In the absence of those 
data, we here focus on whether including tran- 
saction costs would change the main conclusion 
reported in Banks-Leite et al. {2). Specifically, 
we ask whether the set-aside program would 
become prohibitively expensive if transactions 
costs are explicitly accounted for, and discuss ways 
in which transaction costs could be curtailed. 
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Universidade de Sao Paulo, Rua do Matao, 321, Travessa 14, 
05508-090, Sao Paulo, SP, Brazil. ^Department of Ecology, 
Evolution, and Behavior, University of Minnesota, 215 Ecology 
Building, 1987 Upper Buford Circle, Saint Paul, MN 55108, 
USA. ^Fundagao Florestal, Rua do Horto, 931, 02377-000, 

Sao Paulo, SP, Brazil. ^Departamento de Zoologia, Instituto 
de Biociencias, Universidade Estadual Paulista, Caixa Postal 
199, 13506-900, Rio Claro, SP, Brazil. ^Curso de Gestao 
Ambiental, Escola de Artes, Ciencias e Humanidades, 
Universidade de Sao Paulo, Avenida Arlindo Bettio, 1000, 
03828-000, Sao Paulo, SP, Brazil. ^Department of Ecology 
and Evolutionary Biology, University of Toronto, 25 Willcocks 
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^Corresponding author. E-mail: c.banks@imperial.ac.uk 


Finney provides data to show that transac- 
tion costs vary widely and mentions one spe- 
cific case study where transaction costs were 
shown to be comparable to PES costs. This 
suggests that the PES estimates reported by 
Banks-Leite et al. (2) were roughly half of the 
real cost of paying for set-asides in the Atlantic 
Forest. We have reanalyzed the data provided 
in Banks-Leite et al. (2) by doubling our previous 
estimates of US$56.3 million per year for paying 
landowners to set aside 424,000 ha of private land 
for restoration. In our new estimates, US$112.6 
million would be needed to cover PES and tran- 
saction costs, which, added to the active restora- 
tion costs of US$141.3 million, gives a total sum of 
US$253.9 million per year for the biome-wide set- 
aside program. This estimate accounts for 0.0118% 
of Brazil’s GDP (previous calculated as 0.009%) 
and 8.3% of Brazil’s annual expenditure on agri- 
cultural subsidies (previous calculated as 6.5%). 
The new figures are obviously higher but still 
show the feasibility of the set-aside program 
advocated in Banks-Leite et al. (2) and suggest 
that even greater transaction costs would still 
be feasible. 

The data reported by Finney in table 1 (1) 
show that 55% of transaction costs consist of 
general assessment, property mapping, and mon- 
itoring, but we believe that the costs of these 
measures can be reduced. First, the Brazilian 
government is now implementing a new program 
called CAR (Cadastro Ambiental Rural), for which 
it will buy high-resolution satellite images for 
the whole country every year and will restrict 
the endowment of rural credits just to land- 
owners who submit to the national database a 
detailed map of their properties (including native 
vegetation, production areas, legal reserves, and 


potential areas for restoration). This would reduce 
the costs of general assessment and property map- 
ping by improving the database of potential areas 
for restoration. 

Second, remote sensing techniques are often 
underused in pilot projects, such as the ones 
mentioned by Finney, and their efficiency and 
cost-effectiveness dramatically increase with 
scale. For instance, monitoring the recovery of 
biodiversity and ecological processes depends 
on expensive and time-consuming field work. 
However, field monitoring can be replaced by 
remote sensing and appropriate landscape in- 
dicators for a fraction of the costs (3), thus re- 
ducing the need for a more complete and detailed 
field assessment to just a subset of the restored 
sites. 

It is also appropriate to further explore an- 
other advantage of the set-aside program ad- 
vocated by Banks-Leite et al. (2). Although we 
propose the restoration of 424,000 ha of Atlan- 
tic Forest, the initial area can be increased or 
reduced according to an existing budget, and 
more areas can be added to the program once 
active restoration is no longer needed. For in- 
stance, using Banks-Leite’s et al. (2) original 
estimates, let’s consider that only $100 million 
can be committed each year, which halves the 
amount of area that can be set aside for res- 
toration. When active restoration practices are 
no longer needed after 3 years, the overall costs 
of the program would drop to US$28.5 million, 
which means that US$71.5 million would become 
available for restoring a further 153,500 ha of 
priority landscapes. The addition of new areas 
to the program after the active-restoration pe- 
riod can be iterated many times and could po- 
tentially restore up to 750,000 ha within 30 years 
with a limited budget of US$100 million per 
year. If the whole budget of US$200 million is 
available, the effect of this program would be 
much wider and larger and it would deliver out- 
comes much faster. 

In conclusion, although Finney raises a very 
important issue that we indeed had not dealt 
with in Banks-Leite et al. (2), we believe that even 
reasonably high transaction costs would not be 
an impediment to the proposed PES scheme 
and that our approach is still robust and cost- 
efficient. A biome-wide set-aside program is also 
still more realistic than creating protected areas, 
and it can be easily adapted to protecting water- 
sheds or other discontinuous areas. 
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How a food poisoning toxin 
interferes with tight junctions 

Saitoh et al., p. 775 



DENTAL MATERIALS 

Key trace 
minerals greatly 
strengthen teeth 

T he outer layers of teeth 
are made up of nanowires 
of enamel that are prone 
to decay. Gordon eta/, 
analyzed the composition 
of tooth enamel from a variety of 
rodents at the nanometer scale 
(see the Perspective by Politi). In 
regular and pigmented enamel, 
which contain different trace 
elements at varying bound- 
ary regions, two intergranular 
phases— magnesium amorphous 
calcium phosphate or a mixed- 
phase iron oxide— control the 
rates of enamel demineralization. 
This suggests that there may be 
alternative options to fluoridation 
for strengthening teeth against 
decay. — MSL 

Science, this issue p. 746; see also p. 712 



Rodent skull shows 
pigmented and regular 
tooth enamel 


« I 


SUPERCONDUCTIVITY 

Light switch for 
superconductivity 

The conducting properties of 
materials sensitively depend on 
the available carrier concentra- 
tion. Physicists can vary this 
concentration by inducing carri- 
ers in the material: for example, 
by placing it next to an ionic liq- 
uid in an electric field. Suda etal. 
instead used a layer of the mol- 
ecule spiropyran, which changes 
from a non-ionic to an ionic form 
when it is irradiated by ultraviolet 
(UV) light. The authors placed 
the layer on top of a thin crystal 


of an organic material. When they 
shone UV light on the spiropyran, 
the adjacent material became 
superconducting, thanks to the 
carriers induced at the interface. 
-JS 

Science, this issue p. 743 


MAMMALIAN EVOLUTION 

Taking advantage of new 
neighborhoods 

Mammals are one of the most 
morphologically diverse organ- 
isms, with adaptation to unique 
ecological conditions creating a 
wide array of forms, from mice to 


whales. Two new basal mammals 
from the mid-Jurassic period 
suggest that this diversification 
was well under way millions of 
years earlier than previously 
thought. Luo etal. describe a bur- 
rowing species with limb and digit 
modifications similar to those of 
current burrowers and identify 
the likely genes and developmen- 
tal pathways involved. Meng et 
al. describe an arboreal species 
with modifications for climbing 
and that possessed teeth clearly 
adapted for a herbivorous diet, 
including the consumption of 
sap.-SNV 

Science, this issue p. 760, p. 764 


MARINE POLLUTION 

Dumping lots of plastics 
into our oceans 

Considerable progress has been 
made in determining the amount 
and location of plastic debris in 
our seas, but how much plastic 
actually enters them in the 
first place is more uncertain. 
Jambeck etal. combine available 
data on solid waste with a model 
that uses population density and 
economic status to estimate the 
amount of land-based plastic 
waste entering the ocean. Unless 
waste management practices 
are improved, the flux of plastics 
to the oceans could increase by 
an order of magnitude within the 
next decade. — HJS 

Science, this issue p. 768 


CANCER 

A PET approach to 
imaging brain tumors 

Cancers are readily visible with 
positron emission tomography, 
or PET, an imaging method in 
which a radioactively labeled 
tracer accumulates in the tumor. 
Labeled glucose is often used 
as a tracer because tumor cells 
tend to require large amounts 
of glucose. PET cannot be used 
to image brain tumors, however, 
because normal brain cells 
are also highly dependent on 
glucose. Now Venneti et al. show 
that radiolabeled glutamine, 
which is also taken up by tumor 
cells, yields clear images of brain 
tumors in mice and humans. 

With the glutamine label, cancer 
cells can be distinguished from 
normal brain cells and even 
from tumors that are no longer 
growing. — YN 

Sci. Trans! . Med. 7 , 274ral7 (2015). 
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HUMORAL IMMUNITY 

For a single B cell, many 
roads to take 

To successfully fight a pathogen, 
immunological B cells must 
wage a multipronged attack: 
They can differentiate into anti- 
body-secreting plasma cells or 
become T cell-helping germinal 
center cells, or even long-lived 
memory cells. But can a single B 
cell acquire all of these different 
fates? To find out, Taylor et al. 
tracked the responses of single 
B cells in mice. Although some B 
cells acquired only one fate, oth- 
ers differentiated into all three. 
The authors linked the ability of 
B cells to differentiate into mul- 
tiple subsets to their ability to 
proliferate and resist cell death, 
and the affinity of their antigen 
receptor. — KLM 

Sc/ence, this issue p. 784 


NEURAL CIRCUITS 

Taking a snapshot of 
active brain circuitry 

Neuroscientists now have a 
method to mark active popula- 
tions of neurons in vivo to study 
circuit activity in the behaving 
animal. Fosque etal. designed 
and thoroughly validated a fluo- 
rescent protein-based reagent 
that allows permanent marking 
of active cells over short time 
scales. This indicator, termed 
CaMPARI, switches from its 



Active neuronal circuits 
revealed in zebrafish 


native green to a red fluores- 
cent state by simultaneous 
illumination with violet light and 
exposure to increased levels of 
intracellular calcium. CaMPARI 
successfully marked active 
nerve cells in Drosophila, zebra- 
fish, and mouse brains. — PRS 

Science, this issue p. 755 


VIRAL REPLICATION 

A view of the HCV 
poiymeraseatwork 

More than 3% of the world’s 
population is infected with 
hepatitis C virus (HCV), a predis- 
posing factor for life-threatening 
liver diseases such as cirrhosis 
and cancer. HCV encodes a 
polymerase called NS5B that 
catalyzes replication of the viral 
RNA genome. Drugs inhibiting 
NS5B have shown impressive 
antiviral activity in recent clinical 
trials. Appleby etal. (see the 
Perspective by Bressanelli) 
reveal the inner workings of HCV 
RNA replication by analyzing 
crystal structures of stalled 
NS5B polymerase ternary 
complexes during the initiation 
and elongation of RNA synthe- 
sis. They also define the way in 
which sofosbuvir, a drug with 
potent clinical efficacy, interacts 
with the NS5B active site. — PAK 

Science, this issue p. 771; 

see also p. 715 


SENSORY PERCEPTION 

How capsaicin channels 
pain reiief 

Ion channels are essential in 
sensing pain and pressure. The 
chemical in chilies that makes 
them hot is capsaicin, which 
activates the calcium-permeable 
channel TRPVl. Seemingly 
paradoxically, this chemical is 
also used as a topical analgesic. 
Borbiro et al. found that activa- 
tion of TRPVl by capsaicin 
inhibited the mechanosensitive 
Piezo channels by depleting 
specific phosphoinositides in 
the plasma membrane (see the 
Focus by Altier). These results 
may explain some of the analge- 
sic effects of capsaicin. — NRG 
Sci. Signal. 8 , ral5 and fs3 (2015). 
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Edited hy Kristen Mueller 
and Jesse Smith 



PLANT SCIENCE 

HIV antivirals from engineered soybeans 

A ffordable antivirals used as vaginal microbicides could 
have a substantial impact on the HIV epidemic, particu- 
larly in the developing world. One potential candidate is 
cyanovirin-N, a protein produced by a cyanobacterium 
that prevents viral entry in preclinical studies. Large- 
scale production of cyanovirin-N, however, is prohibitively 
expensive. To get around this, O’Keefe et al. genetically 
engineered soybean seeds to make cyanovirin-N. The seeds 
produced large quantities of the antiviral and it survived 
the normal industrial processing systems already in place 
for soybeans. By rough estimate, one greenhouse growing 
engineered soybeans could provide enough cyanovirin-N to 
protect a woman for 90 years. — PJH 

Plant Biotech. J. 10.1111/pbi.l2309 (2015). 


NEUROSCIENCE 

Brain activity predicts 
later recollection 

When people remember details 
about a specific event (a process 
referred to as “recollection”), a 
part of their brain called the core 


recollection network is highly 
active. To better understand the 
underlying connectivity of dif- 
ferent brain regions involved in 
recollection. King et al. scanned 
the brains of people while they 
were retrieving earlier memo- 
ries. Memory retrieval led to 
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ENVIRONMENTAL SCIENCE 

Changing tastes in marine 
microbe food webs 

Protists are single-celled organ- 
isms complete with nuclei, 
organelles, and symbionts, 
and possess a multiplicity of 
physiological talents. They are 
ubiquitous, abundant, and often 
neglected by science. Worden 
et al. review the challenges of 
understanding the role protists 
play in geochemical cycling in 
the oceans. These organisms 
can photosynthesize like plants, 
graze on bacteria and archaea, 
parasitize each other and bigger 
creatures, have sex, and some- 
times do all these things serially 
as conditions change. Their 
activities may have a significant 
influence on carbon cycling, 
and research efforts need to be 
amplified to understand their 
functional importance in marine 
ecosystems. — CA 

Science, this issue p. 735 

SUSTAINABILITY 

Crossing the boundaries 
in giobai sustainabiiity 

The planetary boundary (PB) 
concept, introduced in 2009, 
aimed to define the environmen- 
tal limits within which humanity 
can safely operate. This approach 
has proved influential in global 
sustainability policy development. 
Steffen etal. provide an updated 
and extended analysis of the 
PB framework. Of the original 
nine proposed boundaries, they 
identify three (including climate 
change) that might push the 
Earth system into a new state if 
crossed and that also have a per- 
vasive influence on the remaining 
boundaries. They also develop 
the PB framework so that it can 
be applied usefully in a regional 
context. — AMS 

Sc/ence, this issue p. 736 


ORGANIC CHEMISTRY 

Optimizing a catalyst 
many ways at once 

Optimization strategies are 
often likened to hikes in a hilly 
landscape. If your goal is to get 
to the top of the highest hill, and 
you only take steps toward higher 
ground, you might never find a 
peak on a route that requires 
a preliminary descent. So it is 
in chemistry, where optimiz- 
ing each structural feature of 
a catalyst consecutively might 
gloss over subtle tradeoffs that 
in combination offer the best 
performance. Milo etal. use 
multidimensional analysis tech- 
niques to generate a predictive 
model of how selectivity depends 
on multiple characteristics of the 
catalyst and substrate in a C-N 
bond-forming reaction (see the 
Perspective by Lu). They then 
apply this model to improve the 
catalyst globally. — JSY 

Science, this issue p. 737; 

see also p. 719 


ATMOSPHERIC CHEMISTRY 

Double trouble for 
Criegee intermediates 

Atmospheric ozone reacts with 
unsaturated hydrocarbons to 
form unstable compounds called 
Criegee intermediates. Only 
recently did a reliable method 
emerge to make and study these 
compounds in the laboratory, 
and they didn’t seem to react 
with water. Now, Chao etal. 
show that the simplest Criegee 
intermediate, CH 2 OO, does 
in fact react very rapidly with 
pairs of water molecules (see 
the Perspective by Okumura). 
Earlier studies may have missed 
this result because they were 
conducted at low pressure, 
where water dimers are scarce, 
or because they monitored 
downstream processes. The 


new rate measurements imply 
that reactivity with atmospheric 
water pairs is a major decay 
channel for Criegees. — JSY 

Science, this issue p. 751; 

see also p. 718 


TIGHT JUNCTIONS 

How a toxin makes 
epitbelial sheets leaky 

The entire human body and 
its many compartments are 
shielded from their external 
environments by the barrier 
function of epithelial cell sheets. 
The paracellular barrier func- 
tion of tight junctions (TJs) is 
critical for maintaining homeo- 
stasis in any multicellular 
organism, especially in the skin 
and internal organs and at the 
blood-brain barrier. One of the 
major components of TJs is a 
family of adhesive membrane 
proteins known as claudins. 
Several members of the claudin 
family are receptors for the 
bacterial toxin Clostridium 
perfringens enterotoxin. This 
toxin often causes food-borne 
illness both in humans and 
animals. Saitoh etal. crystallized 
a complex between the toxin 
and a claudin that reveals just 
how the toxin damages epithelial 
barriers (see the Perspective by 
Artursson and Knight). — SMH 
Science, this issue p. 775; 

see also p. 716 

CHEMICAL BIOLOGY 

Toward drugging the 
undruggable in cancer 

Many human cancers are 
characterized by inappropriate 
activity of transcription factors. 
These proteins are attractive drug 
targets in principle, but normal- 
izing their function requires 
drugs that modulate specific 
protein-protein interactions. 


a goal that has been challeng- 
ing. In acute myeloid leukemia, 
a chromosomal translocation 
creates an aberrant form of the 
transcription factor CBE-beta, 
which outcompetes “normal” 
CBE-beta for binding to another 
transcription factor called RUNXl, 
thereby deregulating its activity. 
Illendula etal. identified and 
chemically optimized a small 
molecule that selectively disrupts 
the interaction between the aber- 
rant CBE-beta and RUNXl (see 
the Perspective by Koehler and 
Chen). This molecule restored 
normal gene expression patterns 
and delayed leukemia progres- 
sion in mice. Thus, transcription 
factors may not be as undrug- 
gable as once thought. — PAK 

Science, this issue p. 779; 

see also p. 713 


CENTROMERES 

The a-satellite arrays of 
human centromeres 

Centromeres link sister 
chromatids and are central 
to chromosome segregation. 
Centromeres are found 
within the tandemly repetitive 
a-satellite DNA sequences. It 
has been difficult to define the 
molecular processes involved in 
the formation of centromeres. 
Henikoff eta/, identified two 
sets of human a-satellite dimer 
sequences that are consistently 
associated with particular 
centromere-associated nucleo- 
somes— the proteinaceous 
structures around which 
chromosomal DNA is wrapped. 
This finding may help to identify 
other tandem repeat arrays 
within the centromeres of other 
eukaryotes and should allow for 
a better molecular understand- 
ing of centromere evolution and 
function. — ASH 

Science Advances 10.1126/ 
sciadv.00234(2015). 
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Ingested algae pitch in 
to prevent starvation 



SYMBIOSIS 


Plastid thieves escape starvation 


G reen sea slugs feed on algae, and in the process they take up photosynthetic plastids from 
the algae. The plastids stay intact in the slug’s gut cells for months, and the slugs benefit 
nutritionally from ongoing photosynthesis. So much so that that some scientists think that 
the plastids can prevent slugs from dying of starvation. But photosynthesis and starvation 
can also produce toxic products that cause the slugs to die. de Vries etal. found one species 
of slug that does not rely on photosynthesis to keep it going during hard times. This slug resists 
death by ramping up mechanisms to remove damaged plastids and toxins. — CA 


Proc. R. Soc. B 10.1098/rspb.2014.2519 (2015). 


increased activity in specific 
areas of their brains, both in the 
core recollection network and in 
other regions. By analyzing the 
active areas in different people, 
areas of functional connectivity 
emerged. More accurate recol- 
lections correlated with greater 
connectivity among these differ- 
ent regions. — PRS 

J. Neurosci. 35,1763(2015). 


BIOMATERIALS 

Better bone patching with 
hydrogel foam 

Bone grafts are currently the clin- 
ical standard for replacing larger 
bone defects caused by trauma, 
infections, or tumors. Degradable 
biomaterials are not sufficient for 
regenerating bone without the 
addition of peptides or growth 
factors to attract and direct the 
needed cells. Neffe etal. devised 
a one-step process for creating 



Pore structure of a 3D architectured 
hydrogel (ArcGel) 


hydrogels through a foaming 
process using gelatin and lysine 
connected by urea junctions. 

The gels form an open porous 
structure that allows cells to 
invade and provides support for 
their adhesion, as well as offering 
tailorable local environments 
and mechanical properties and 
controlled and rapid degradation. 
In vivo testing for the regenera- 
tion of 5-mm femoral defects 
in rats showed healing capacity 
comparable to that of bone grafts 
after 6 weeks. — MSL 

Adv. Mater. 10.1002/ 
adma.201404787(2015). 


BIOMECHANICS 

Measuring the forces that 
shape tissues 

During development, intracellular 
molecular motors generate forces 
that cause cells to change shape 
and remodel their contacts with 
other cells. But exactly how these 
forces change cell shape is largely 
unknown. Bambardekareta/. 
used light-sheet microscopy and 
optical tweezers to image a devel- 
oping fruit fly embryo and directly 
measure the tension at cell-cell 
junctions. As morphogenesis 
progressed, the tension increased 
and became anisotropic (direc- 
tional) across the embryo. On 


the basis of these measurements 
and by monitoring the effect of 
cell deformations on neighboring 
cells, the authors then created 
a model that predicts how local 
deformations propagate through 
the tissue. — VV 

Proc. Natl. Acad. Sci. U.S. A. 10.1073/ 
pnas.l418732112(2015). 


ENVIRONMENTAL SCIENCE 

Energy and wastewater 
treatment, too 

Treating wastewater removes 
organic matter that would other- 
wise harm the environment when 
released back into waterways. 
New bioelectrochemical systems 
provide the added benefit of 
using this organic matter as 
a fuel for bacteria to generate 
hydrogen gas or electricity during 
wastewater treatment. However, 
concerns remain about whether 
the other functions of wastewa- 
ter treatment, including reducing 
the amount of trace organic com- 
pounds released back into the 
environment, will be lost. Werner 
etal. tested the breakdown of low 
levels of pharmaceuticals, pesti- 
cides, and antibiotics in microbial 
fuel cells and microbial electroly- 
sis cells. Although some of these 
contaminants persisted after 
passing through the microbial 
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reactors, the breakdown effi- 
ciency was similar to that of most 
modern wastewater treatment 
methods. — NW 

Water Res. 10.1016/ 
j.watres.2015.01.013 (2015). 

OPTICS 

A temperature probe for 
the ultracold 

The temperature of an everyday 
macroscopic object is well under- 
stood and easy to measure. The 
concept of temperature, however, 
becomes blurry when the number 
of particles in a system becomes 
so small that the ensemble 
breaks down into a number of 
individual components, such as 
for atoms or molecules trapped 
in an optical lattice and cooled to 
ultralow temperatures. McDonald 
etal. demonstrate a spectro- 
scopic technique that uses the 
line shift and change in the line 
shape of an atomic or molecu- 
lar transition to determine the 
temperature of trapped atoms 
or molecules down to nanokelvin 
temperatures. This technique 
should prove useful in a range of 
applications, including metrol- 
ogy and probing the energetic 
dynamics of ultracold chemical 
reactions. — ISO 

Rhys. Rev. Lett. 114, 23001 (2015). 


CANCER RESEARCH 

Lifetime risk of cancer 
goes up 

More than 50% of people born in 
the U.K. since 1960 will develop 
cancer in their lifetimes, finds 
a new study. Combining actual 
cancer rates from 1951 to 2012 
with projected rates for 2013 
through 2060, Ahmad etal. esti- 
mated lifetime risks for men and 
women born in 1930 with those 
for men and women born from 
1931 to 1960. For men, the risk 
increased from 38.5% for those 
born in 1930 to 53.5% for those 
born in I960: for women, risk 
increased from 36.7% to 47.5%. 
Increasing life expectancy was 
the primary reason for the risk 
increase. — CG 

Br J. Cancer, 10.1038/ 
bjc.2014.606(2014). 
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ENVIRONMENTAL SCIENCE 

Ret hinkin g the marine carbon cycle: 
Factoring in the multifarious 
lifestyles of microbes 


Alexandra Z. Worden,* * Michael J. Follows. 
Amy E. Zimmerman, Patrick J. Keeling 

BACKGROUND: Marine ecosystems are com- 
posed of a diverse array of life forms, the 
majority of which are unicellular— archaea, 
bacteria, and eukaryotes. The power of these 
microbes to process carbon, shape Earth’s at- 
mosphere, and fuel marine food webs has been 
established over the past 40 years. The marine 
biosphere is responsible for approximately half 
of global primary production, rivaling that of 
land plants. Unicellular eukaryotes (protists) 
are major contributors to this ocean produc- 
tivity. In addition to photosynthetic growth, 
protists exhibit a range of other trophic modes, 
including predation, mixotrophy (a combina- 


Stephen J. Giovannoni, Susanne Wilken, 


tion of photosynthetic and predatory-based 
nutrition), parasitism, symbiosis, osmotrophy, 
and saprotrophy (wherein extracellular en- 
zymes break down organic matter to smaller 
compounds that are then transported into the 
cell by osmotrophy). 

ADVANCES: Sensitive field approaches have 
illuminated the enormous diversity of protis- 
tan life (much of it uncultured) and, coupled 
with activity measurements, are leading to 
hypotheses about their ecological roles. In par- 
allel, large-scale sequencing projects are provid- 
ing fundamental advances in knowledge of 


genome/gene composition, especially among 
photosynthetic lineages, many of which are 
complex amalgams derived from multiple 
endosymbiotic mergers. Marine protists have 
yielded insight into basic biology, evolution, 
and molecular machine- 
ries that control oi^anismal 
responses to the environ- 
ment. These studies reveal 
tightly controlled signal- 
ing and transcriptional 
regulation as well as re- 
sponses to limitation of resources such as iron, 
nitrogen, and vitamins, and offer understand- 
ing of animal and plant evolution. With the 
formulation of better computational approaches, 
hypotheses about interactions and trophic ex- 
changes are becoming more exact and mod- 
elers more assertive at integrating different 
data types. At the same time, the impacts of 
climate change are being reported in multiple 
systems, of which polar environments are the 
touchstone of change. 

OUTLOOK: Driven by the need to translate the 
biology of cells into processes at global scales, 
researchers must bring the conceptual frame- 
work of systems biology into bigger “ecosys- 
tems biology” models that broadly capture the 
geochemical activities of interacting plankton 
networks. Existing data show that protists are 
major components of marine food webs, but 
deducing and quantifying their ecosystem link- 
ages and the resulting influences on carbon 
cycling is difficult. Genome-based functional 
predictions are complicated by the importance 
of cellular structures and flexible behaviors 
in protists, which are inherently more difficult 
to infer than the biochemical pathways typ- 
ically studied in prokaryotes. Alongside the 
plethora of genes of unknown function, manipu- 
lable genetic systems are rare for marine pro- 
tists. The development of genetic systems and 
gene editing for diverse, ecologically impor- 
tant lineages, as well as innovative tools for 
preserving microbe-microbe interactions dur- 
ing sampling, for visual observation, and for 
quantifying biogeochemical transformations, 
are critical but attainable goals. These must be 
implemented in both field work and laboratory 
physiology studies that examine multiple environ- 
mental factors. Expanding genome functional 
predictions to identify the molecular under- 
pinnings of protistan trophic modes and realis- 
tically constrain metabolism will position the 
field to build reliable cell systems biology mod- 
els and link these to field studies. By factoring 
in true complexities, we can capture key elements 
of protistan interactions for assimilation into 
more predictive global carbon cycle models. ■ 

The list of author affiliations is available in the full article online. 
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Global biogeochemical and ecological models rely on understanding organismal biology and 
the interactions occurring in marine microbial food webs. Protists have multifarious roles from 
the sunlit surface ocean to leagues below. Understanding of protistan behaviors and adaptability lags 
far behind knowledge of evolutionary processes that have shaped their genomes. As such, microbial 
mediation of carbon fluxes and specific interactions remain ill-resolved and predictive capabilities are 
still weak. Strategies to narrow this gap involve iteration between experimental and observational 
field studies, controlled laboratory experiments, systems biology approaches that preserve cellular 
structures and behaviors using relevant model taxa, and computational approaches. 
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Ret hinkin g the marine carbon cycle: 
Factoring in the multifarious 
lifestyles of microbes 

Alexandra Z. Worden/’^’^* Michael J. Follows,^ Stephen J. Giovannoni,^ 

Susanne Wilken,^ Amy E. Zimmerman/ Patrick J. Keeling^’® 

The profound influence of marine plankton on the global carbon cycle has been recognized 
for decades, particularly for photosynthetic microbes that form the base of ocean 
food chains. However, a comprehensive model of the carbon cycle is challenged by 
unicellular eukaryotes (protists) having evolved complex behavioral strategies and 
organismal interactions that extend far beyond photosynthetic lifestyles. As is also true 
for multicellular eukaryotes, these strategies and their associated physiological changes 
are difficult to deduce from genome sequences or gene repertoires— a problem 
compounded by numerous unknown function proteins. Here, we explore protistan trophic 
modes in marine food webs and broader biogeochemical influences. We also evaluate 
approaches that could resolve their activities, link them to biotic and abiotic factors, 
and integrate them into an ecosystems biology framework. 


T he marine carbon cycle plays a critical role 
in Earth’s habitability for humans and oth- 
er large fauna. At the core of the cycle are 
interconversions of inorganic and organic 
carbon forms, which lead to major ecosys- 
tem services. Protistan (unicellular eukaryotic) 
phytoplankton have long been recognized as 
foundational to fisheries and export of atmospher- 
ic CO 2 to the deep ocean {1-3). Protists convert 
CO 2 to organic carbon via photosynthesis, simul- 
taneously altering cycles of other elements linked 
to carbon by the stoichiometry of cellular compo- 
sition. Thus, the carbon cycle interacts with bio- 
geochemical cycles of nitrogen, silica, and many 
other elements {4-6). How air-sea exchanges of 
CO 2 , primary productivity, and carbon seques- 
tration to the deep sea will be altered as climate 
change affects these and other microbes (7) is 
poorly understood. Here, we review current under- 
standing of the diverse functionality and lifestyles 
of marine protists and how they participate in 
the carbon cycle. 

During the 1970s and 1980s, ideas emerged 
about microbial connectivity in ocean food webs 
{8) and the microbial loop (9), which formalized 
the importance of bacteria and archaea as hubs 
of metabolic diversity. The classical diatom- 
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copepod-fish food chain view of ocean productiv- 
ity was thus revolutionized by the understanding 
that diverse forms of dissolved organic matter 
(DOM) derived from algae were a major energy 
source for heterotrophic bacterial growth (Fig. 1). 
In turn, predatory protists consumed the bacte- 
ria and were themselves preyed upon by larger 
zooplankton. In this framework, algal primary 
production had multiple routes to higher troph- 
ic levels. The microbial loop was the most ineffi- 
cient route in terms of respiratory carbon losses 
(conversion of organic carbon to CO 2 ) because 
as algal DOM and particulate organic matter 
(POM) pass through heterotrophic bacteria and 
archaea, respiration losses occur alongside im- 
portant nutrient remineralization. Viral lysis of 
bacteria was subsequently recognized as another 
potentially substantial source of DOM {10). Protists 
are now often represented as either photoauto- 
trophs (typically diatoms and coccolithophores) 
or heterotrophic predators (Fig. 1). 

Global-scale simulations of ocean ecosystems 
and biogeochemical cycles, overlain on circulation 
models, have become a common tool for carbon 
cycle and climate sensitivity studies, including 
Intergovernmental Panel on Qimate Change as- 
sessments. Such models focus on simulating bio- 
geochemical cycles rather than the organisms that 
mediate them, and build on the seminal studies 
of Riley and colleagues {11). Plankton populations 
are described by partial differential equations that 
represent physical transport, growth, death, and 
interactions at broad levels. Today, such models 
resolve a few broad “functional types” of photo- 
autotrophic microbes (e.g., all small phytoplank- 
ton described by a single set of parameters) and 
two predator groups: protistan and metazoan 
consumers of algae, or “grazers” {12-14). If het- 


erotrophic bacteria are included, it is as one 
homogeneous population. Population growth 
rates are described using highly idealized rela- 
tionships to external resources (75) or internal 
stores {16), without representation of the highly 
flexible and adaptive physiology of the organisms 
concerned. In part, the current parameterization 
of microbial physiology in models reflects com- 
putational limitations, but it also reflects a lag in 
assimilation of new data and understanding of 
microbial cell biology. Determining the level of 
granularity required to accurately simulate and 
interpret ocean ecosystems, the type of baseline 
information that would be adequate to assess 
change, and how to integrate organismal diver- 
sity, dynamics, and interactions into large-scale 
models are not easy tasks. 

As reviewed here, protistan biology comprises 
diverse lifestyles that shape the carbon cycle 
through elaborate but poorly appreciated food 
web connections. A repertoire of phagotrophic 
modes and a penchant for symbioses have led to 
the evolution of cell architectures that can be orders 
of magnitude larger and more complex than those 
of prokaryotes (77). Size matters, in part because 
larger cells (such as the largest coccolithophores) 
sink faster, altering export of carbon and other 
elements to the deep ocean on massive scales 
{17-20). Marine microbiologists and modelers alike 
are becoming acutely aware that phenotypic varia- 
tion in protistan physiology and behavior is crit- 
ical to assessing their broader ecosystem roles 
and future ocean productivity. 

Complex factors govern protistan 
primary production: Phytoplankton 

Marine algae account for about 50 Pg C year"^ of 
primary production, rivaling that of terrestrial 
plants (7). In addition to the roles played by larger 
photosynthetic protists, understanding of the im- 
portance of picoplanktonic taxa (diameter <2 pm) 
is growing {21-23). In the 1950s, the picoeukaryote 
Micromonas pusilla was shown to dominate in 
the English Channel {23), and subsequent discov- 
eries of the abundant (non-eukaryotic) cyanobacte- 
ria Synechococcus {24) and Prochlorococcus {2S) 
confirmed the importance of very small phytoplank- 
ton to marine systems. However, while larger al- 
gae such as diatoms have clear food chain roles 
and fast, quantifiable sinking rates that result in 
carbon export to the deep sea, similar knowledge 
is limited for eukaiyotic and prokaiyotic picoplank- 
ton {20, 26). Moreover, photosynthetic protists are 
extremely diverse; many are difficult to quantify 
in nature, many remain uncultured, and we lack 
baseline information on wild populations. “New” 
algal lineages are still being discovered {27, 28). 
Additionally, picoplanktonic eukaiyotes, including 
uncultured groups, are unexpectedly important 
contributors to CO 2 fixation in environments where 
cyanobacteria dominate numerically {29-32). 

Eukaryotic phytoplankton have evolved from 
heterotrophic ancestors on multiple independent 
occasions, resulting in marked differences in ge- 
nome content and functional capabilities, from 
which they continue to diversify (Box 1 and Fig. 2). 
Comparing the cell wall composition of four 
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common algal lineages illustrates this diversity 
and its influence on ocean biogeochemistry: Some 
stramenopiles have silicate frustules of nanoscale 
precision (diatoms), whereas others are naked; 
haptophytes can be covered in ornate calcium 
carbonate plates (coccolithophores), whereas many 
picoplanktonic species are naked or have organic 
scales; dinoflagellates can possess intracellular 
cellulose plates or be naked; and among the 
prasinophytes, a few species are naked but most 
are enveloped by organic scales (4, 33, 34). Con- 
sequently, the growth of these algae depends on 
different elements’ input sources, remineraliza- 
tion rates, and crystal structures (e.g., calcium 
carbonate in the form of aragonite or calcite) and 
will have correspondingly distinct responses to 
decreasing pH caused by climate change (35). 

Analysis of algae in the context of genomic 
information is advancing understanding of 
their physiology. These studies often focus on 
diatoms or prasinophytes, underscoring highly 
differentiated responses to iron and nitrogen 
availability, tightly controlled gene regulatory 


programs, and signaling systems conserved 
with land plants [e.g., {36-41)']. Genomes are 
available for other marine phytoplankton as 
well (42-44). Complex relationships between re- 
source availability and adaptive strategies are 
also being identified using genomes from cultured 
strains as contextual information and innovative 
approaches to unraveling interactive effects in 
the field (45, 46). For example, diatoms in a Pa- 
cific Ocean study were highly responsive to iron 
fertilization, but metatranscriptome analyses 
suggested continued dependence on iron-free pho- 
tosynthetic proteins rather than a switch to iron- 
containing functional equivalents present in their 
gene repertoire (45). This is hypothesized to al- 
low newly available iron to be used for resource 
acquisition (rather than for photosynthetic ma- 
chinery), contributing to frequent diatom success 
under iron fertilization. These findings would be 
difficult to deduce in laboratory experiments, be- 
cause they would not capture the sharp contrasts 
between diatom responses and those of the broad- 
er algal community. 


Nonetheless, many aspects of algal physiology 
remain unclear. This knowledge gap is be- 
coming critical as climate change influences ma- 
rine food webs and carbon sequestration (47-49). 
Perturbations typically influence multiple envi- 
ronmental parameters simultaneously, and algal 
responses appear to be highly regionalized. For 
example, in the Canadian Arctic, larger photo- 
synthetic protists such as diatoms are reportedly 
being replaced by photosynthetic picoeukaryotes 
(50), and similar shifts are expected in other re- 
gions as a result of stratification-related causes, 
altering both the food web and carbon export. 
In contrast, massive amounts of algal carbon, 
primarily from diatoms, are sinking to the east- 
ern central Arctic sea floor (51). The interactive 
effects of perturbations are extensive [e.g., climate- 
influenced CO 2 , pH, and temperature changes; re- 
viewed in (52)], but most laboratory experiments 
fail to capture their complexity. Cell systems 
biology experiments involving the controlled 
study of model organisms in culture are essen- 
tial for designing field studies that gauge the 
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Fig. 1. A common depiction of marine microbial food webs. Dissolved 
and particulate organic matter (DOM and POM, respectively) from phy- 
toplankton form a basic resource for bacteria and archaea, which respire 
OO 2 (via the microbial loop, purple arrow), thereby modifying the flow of 
carbon expected in earlier diatom-copepod-fish food chains (diatoms are 
algae within the stramenopile lineage, and copepods are multicellular zoo- 
plankton in the opisthokont lineage). Most such schemas do not differen- 
tiate phytoplankton in different size fractions, although size influences food 


web linkages. Inorganic nutrients are also important factors shaping com- 
munity composition. By necessity, processes in the euphotic zone, where 
photosynthesis occurs, are differentiated from those in the aphotic zone, 
where sunlight is unavailable. Anthropogenic impacts (not depicted) include 
acidification, coastal eutrophication, and changes to water column structure 
induced by warming, such as stronger stratification and reduced nutrient 
flux into surface waters, as well as increased exposure to high light inten- 
sities (7). [Adapted from (J05)] 
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impacts of individual taxa on higher ecosystem- 
level processes. 

Carbon in the balance: Predation, 
osmotrophy, and parasitism 

Carbon cycling and sequestration depend on car- 
bon oxidation rates as much as on photosynthesis. 
This balance pivots on the wide-ranging nutri- 
tional strategies of heterotrophic organisms (Fig. 3). 
The best-studied heterotrophic mode of marine 
protists is predation. On average, micro- and nano- 
zooplankton (<200 pm) consume 62% of daily 
algal production, albeit with high regional and 
temporal variability (53). These predators are large- 
ly protistan, but grazing measurements generally 
reflect bulk rates vathout group-specific informa- 
tion. This is problematic because the protists in- 
volved are diverse. In productive environments, 
dinofiagellates can be major grazers of diatom 
blooms, whereas in more oligotrophic regions, 
ciliates and diverse flagellates are important 
consumers of picophytoplankton and bacteria 
{32, 54-56). Structurally complex choanoflagellates, 
amoebozoans, dinofiagellates, and rhizarians can 
also be regionally important predators {57-59). 
Members of the latter group, which includes Acan- 
tharia (part of what used to be called “Radiolaria”) 
and Foraminifera, also feed extensively on hetero- 


trophs, including on multicellular zooplankton such 
as copepods {58). The identity of the taxa respon- 
sible for most predation is not known, particu- 
larly for heterotrophic flagellates. Novel marine 
stramenopiles (MASTs) are presumed to dominate 
predatory flagellates {21), but most MASTs are 
uncultured {60-62) and correct assignment of troph- 
ic modes is complicated because the stramenopiles 
include algae, saprotrophs, predators, and mixo- 
trophs (Box 2). Some MASTs clearly consume bacteria 
and picophytoplankton (55, 56), but single-cell 
and colony isolation studies indicate that more 
complex associations also exist. For example, the 
MAST Solenicola setigera grows on frustules of 
the 6\ditom Leptoqjlindriis mediterraneus, some- 
times alongside Synechococcus, upon which it may 
also feed {60). These discoveries show the difficulties 
in discerning the nature of associations in the wild. 

The huge variation in protistan predation 
strategies is not represented in current marine 
biogeochemical and ecological models because 
the bases for prey selection, feeding rates, and 
alternate strategies are not knovm. Generally 
predators are larger than their prey; however, 
protists can ingest prey of equal or larger size 
than themselves. Examples include the haptophyte 
Prymmsiumparvum {63) and the dinofiagellate 
Karlodinium armiger, which immobilizes cope- 


pods that are 50 times its size before consuming 
them {64). As a consequence, organism size does 
not necessarily increase v^th trophic level, allow- 
ing for a longer food chain and comparatively 
greater carbon losses. Although cell size may play 
a role in prey selection, so too does prey quality, 
relative abundance, and extracellular character- 
istics {21, 65-67), which thereby influence the 
flow of carbon through specific populations. 
Assimilation efficiencies vary among predators and 
prey types, as well as with prey quality and quan- 
tity. Growth and carbon assimilation by the pred- 
atory stramenopile Picophagus flageUatm (<2 pm) 
is very different when fed Prochlorococciis versus 
Synechococcus, resulting in differing carbon flows 
between trophic compartments (55). Addition- 
ally, trophic cascades are strengthened at higher 
temperatures, including grazing control of primary 
producers, but in a nutrient-dependent fashion 
{68, 69). The metabolic processes underlying het- 
erotrophy appear to respond more strongly to 
temperature than does primary production, so 
that increased temperature results in a shift to 
more heterotrophic ecosystem metabolism {70). 

Predatory activities also contribute to pools of 
DOM and POM. The marine DOM carbon pool is 
nearly equal to atmospheric CO 2 and about 200 
times that in living marine biomass {71). Both 
DOM and POM are complex amalgams spanning 
a gradient of sizes and levels of recalcitrance. The 
operational definition of DOM— material passing 
through a 0.2-pm filter— means that it ranges from 
small molecules that can be transported directly 
into cells (osmotrophy) to large macromolecules 
or colloids that may require extracellular digestion 
(saprotrophy) prior to osmotrophic utilization. 
Labile forms of organic matter are quickly respired 
or assimilated by bacteria and protists into living 
POM {72, 73). Thus, labile DOM accounts for 
<0.2 Pg C of the 662 Pg C of measurable marine 
DOM in a recent global survey, and most of the 
organic carbon pool is resistant to oxidation on 
time scales from months to millennia {71). 

The diversity of marine organic carbon com- 
pounds necessitates an elaborate network of het- 
erotrophic strategies to decompose it. Most ocean 
models assume that DOM and POM are oxidized 
by a homogeneous class of prokaryotic hetero- 
trophs that live largely by chemistry transacted 
at the cell surface via enzymes that modify, trans- 
port, and/or remineralize organic molecules (i.e., 
chemoheterotrophs). Ironically, although fungi 
are considered central to terrestrial decomposi- 
tion, eukaryotic saprotrophs are not represented 
in ocean ecosystem models, despite growing evi- 
dence of diverse marine fungi {74, 75). Recent 
reports indicate that correlations exist between 
fungal populations and total organic carbon, ni- 
trate, sulfide, and dissolved inorganic carbon, in 
anoxic marine pelagic environments and especially 
at the sea floor (75, 77). If ffingi do contribute to ma- 
rine organic matter degradation— an idea that has 
not yet been formally tested— it will be important 
to learn how flexible they are in this role. To what 
extent do they and other eukaryotes that func- 
tion as marine saprotrophs compete for the same 
substrates? Labyrinthulids and thraustochytrids 


Box 1. Evolutionary history of algal endosymbiosis and putative plastid losses. 

Mitochondria and plastids both arose from the endosymbiotic uptake of a bacterium (an 
alphaproteobacterium and a cyanobacterium, respectively), but the subsequent evolution of 
plastids has been complicated by additional endosymbiosis events. The original or “primary” 
plastid that descended directly from the cyanobacterial endosymbiont is still found in the 
archaeplastids (glaucophytes, red algae, green algae, and plants). But green and red algae have 
themselves been taken up by other eukaryotic lineages, resulting in “secondary” plastids 
characterized by the additional membranes and more complex protein-targeting systems present 
in euglenids, chlorarachniophytes, cryptophytes, haptophytes (also referred to herein as 
prymnesiophytes), stramenopiles, dinofiagellates, and apicomplexans. Some members of these 
groups are mixotrophic, and others are purely heterotrophic (predatory, saprotrophic, or even 
parasitic) because photosynthesis or plastids have been lost. Although green algae are common 
in marine environments (e.g., picoprasinophytes such as Bathycoccus, Ostreococcus, and 
Micromonas), lineages resulting from secondary endosymbiotic partnerships include many other 
important marine primary producers (e.g., diatoms, pelagophytes, prymnesiophytes, and 
dinofiagellates) and represent incredible metabolic versatility. It is interesting to speculate that 
the redundancy and reshuffling of characteristics resulting from mergers of distinct eukaryotic 
lineages favors new combinations of traits with strong ecological potential. 
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are stramenopiles hypothesized to primarily use 
terrestrial organic matter present in coastal ma- 
rine habitats, but they have also been reported 
in oligotrophic waters {78). Preliminary experi- 
ments suggest competition with prokaryotes 
{79), but to what extent is this true in nature? Are 
the activities of eukaryotic saprotrophs largely 
restricted to sediments, or do they operate in the 
water column as well? 

Eukaryotes also use endocytosis to ingest high- 
molecular weight and colloidal DOM, rich sources 
of trace metals {80). The Picozoa ingest, then 
process marine colloids of <150 nm diameter in a 
vacuole {81). These colloids are similar in size to 
many marine eukaryotic viruses. Interestingly, 
virus sequences were detected in association with 
a sorted picozoan cell {82) potentially from an 


infecting virus or food. Regardless, the discovery 
of colloid-sized particle ingestion by picozoans has 
important implications for remineralization rates, 
because intracellular processing is presumably 
more efficient than extracellular mechanisms. 

Another means by which heterotrophic pro- 
tists acquire nutrition is parasitism, an efficient 
strategy once the host is encountered. If life in 
oceans is anything like that on land, then for 
every marine animal species there are likely sev- 
eral parasite species. Infective life stages (Fig. 3, 
inset) allow more constant access to higher con- 
centrations of organic material than are encoun- 
tered by most free-living microbes. The most 
abundant sequences in marine protistan diver- 
sity surveys are novel uncultured marine alveolates 
(MALVs) that belong to the Syndiniales {21, 83). 


Several Syndiniales groups are known to be para- 
sitic and can control blooms of other marine 
dinoflagellates or infect ciliates, other protists, 
and animals (83-85). The interplay between the 
life cycles of one such parasite, Amoebophrya, 
and its dinoflagellate host is complicated, with 
infection speeding up host cyst formation in a 
response hypothesized to promote resistance 
to infection {84). Currently, direct evidence that 
most MALV clades are parasitic is lacking. Asso- 
ciations observed between uncharacterized MALVs 
and radiolarians (85) are equally consistent with 
a symbiotic relationship, and environmental fac- 
tors could potentially shift a relationship from 
commensal to pathogenic. For MALVs that are 
parasitic, what taxa do they infect and by what 
mechanism? Is the outcome of infection benign 
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Fig. 2. Protists constitute the majority of lineages across the eukaryotic 
tree of life. This schematic represents a synthesis of information on mor- 
phology, phylogenetic analyses (based on a few genes from a large diversity of 
organisms), as well as phylogenomic analyses (of many genes from represent- 
atives of major lineages). Seven “supergroups” are indicated by colored 
wedges, all of which contain multiple marine protistan lineages. Relationships 
of groups listed outside the supergroups remain contentious. Peripheral pic- 
tures highlight eukaryotic diversity, both microbial and multicellular. Clockwise 


from right: archaeplastids (rhodophyte, chlorophyte, streptophyte): amoebozoa 
(tubulinid, arcellinid, mycetozoan); opisthokonts (fungus, microsporidian, 
choanoflagellate, cnidarian, bilaterian); excavates (parabasalian, oxymonad, 
euglenid): rhizaria (acantharian, foraminiferan, chlorarachniophyte): alveolates 
(ciliate, dinoflagellate): stramenopiles (labyrinthulid, synurophyte, diatom, 
phaeophyte, actinophryid): unassigned [cryptomonad, katablepharid, haptophyte 
(referred to here as prymnesiophytes, and to which coccolithophores belong)]. 
[Phylogenomic analyses adapted from (J5S)] 
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or pathogenic? Do they have complex life cycles, 
reservoir species, or active free-living stages? 

Parasitism presents challenges for ecosystem 
modeling, even if it is an easily interpretable 
nutritional mode. Thus, parasites, like protistan 
saprotrophs, have not been incorporated into eco- 
system and carbon cycle models. What are the 
tradeoffs between protection against predation, 
resource acquisition strategies, and costs lever- 
aged by high abundance that increase encoun- 
ters with parasites and viruses? Some parasites 
also consume bacteria by phagocytosis. For example, 
the human parasite Trichomonas vaginalis has a 
predatory mode that remains “unseen” in its 
genome {87). Hence, the complexity of protistan 
trophic modes underscores the importance of study- 


ing organisms in natural or near-natural milieus 
vhere sundry interaction possibilities are maintained. 

Diverse metabolic pathways facilitate organic 
matter processing in all domains of life, but in eu- 
karyotes the evolution of phagotrophy and para- 
sitism, in addition to osmotrophy and saprotrophy, 
renders several carbon oxidation modes difficult 
to identify (Fig. 3). Phototrophy is easily recog- 
nized by huge, highly conserved macromolecular 
complexes that are found in plastids, the semi- 
autonomous organelles of photosynthetic pro- 
tists (Box 1). In contrast, heterotrophy is a diverse 
amalgam of pathways that overlaps with wide- 
ly distributed anaplerotic pathways that form 
intermediates used in both autotrophic and het- 
erotrophic metabolisms. Moreover, the specific 


mechanics of most types of protistan heterotro- 
phy are shadowy. Genomic and proteomic studies 
could provide candidate genes for cell surface 
properties related to prey sensing {88) or host 
sensing by parasites. Although all aspects of these 
processes are doubtless driven by proteins en- 
coded in nuclear genomes, these are likely to be 
among the -50% of proteins in any given ge- 
nome that have unknown functions {89). Because 
eukaryotic mechanisms relating to heterotrophy 
may have evolved in parallel (in contrast to shared 
ancestry of photosynthetic systems; Box 2), a mas- 
ter list of genes relating to feeding in one pred- 
ator might have limited predictive power for 
other predatory lineages. As a result, we lack a cat- 
alog of genes responsible for the cellular components 
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Fig. 3. Microbes from the three domains of life interact to structure 
ocean ecosystems and the carbon cycle. CO2 fixation into organic matter 
is performed by diverse photosynthetic eukaryotes, as well as cyanobacteria, 
that have different food web linkages. Eukaryotes are involved in many 
interactions, as depicted in inset boxes. Some widespread phytoplankton 
consume other microbes (mixotrophs; see Box 2 ). Eukaryotic parasites are 
another force of mortality that presumably affects many types of eukaryotes 
(with host specificity), as do viral infections. Death by parasite or viral infection 
results in release of a continuum of nondissolvable organic matter (i.e., POM) 
and DOM, a complex suite of polymers and molecules. Programmed cell death 


of protists (apoptosis: background arrow) has also been proposed. Microbes 
contribute to CO2 respiration (e.g., archaea, heterotrophic bacteria, sapro- 
trophs, parasites, mixotrophs, and even photosynthetic taxa at night), which 
can result in CO2 release to the atmosphere and reduces organic carbon export 
to the deep ocean. Some eukaryotes grow in low-nitrogen regions by using 
symbiotic relationships with N2-fixing bacteria (upper left box). The majority of 
protistan roles and linkages are not represented in ecosystem and carbon cycle 
models, and most have not yet been rigorously quantified. Inorganic nutrients 
and direct release of CO2 depicted in Fig. 1 have been removed for simplification 
purposes. [Parasite life cycle adapted from (J 59 )] 
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involved in protistan heterotrophy, such as 
feeding apparatuses, receptors for specific host 
or prey recognition factors, or signaling path- 
ways that govern predatory behavior and trophic 
mode shifts. 

Shifting boundaries between trophic 
modes: Mixotrophy 

In addition to the oversimplification inherent in 
the convenient categories of “producers” (algae) 
and “consumers” (heterotrophs), a larger prob- 
lem is that many protists do not even stick to one 
or the other role. Recognition that mixotrophy— 
a fusion of phototrophy and heterotrophy— is 
important arose in 1986, when seminal research 
on freshwater plankton showed that “...at least 
some natural phytoplankton are phagotrophic 
and apparently obtain a substantial fraction of 
their energy and nutrients by ingesting bacteria 
at rates very similar to those measured for 
nonphotosynthetic microfiagellates” {90). This 
publication was quickly followed by reports on 
mixotrophic marine algae {91), and more recent 
shipboard experiments demonstrated that small 
algae perform 37 to 95% of total bacterivory in 
the Atlantic Ocean {32). 

Phagotrophic capabilities in otherwise photo- 
synthetic organisms have exciting ecological im- 
plications. Prey ingestion may serve as a source 
of energy and carbon when photosynthesis is 
limited by light availability (e.g., deep in the water 
column or under ice) {92). However, in oligotroph- 
ic euphotic waters where uncultured mixotrophic 
piymnesiophytes and chrysophytes (stramenopiles) 
are prominent {31, 93, 94), acquisition of nutri- 
ents needed for photosynthetic growth, not carbon, 
is considered the feeding trigger {32). Through 
prey digestion, demand for scarce inorganic nu- 
trients is lessened because particulate nutrients 
(from the prey cell) support primary production. 


thereby lowering release of remineralized nutri- 
ents relative to heterotrophic predation. In primar- 
ily heterotrophic mixotrophs, respiratory carbon 
needs can be met by photosynthesis, making high 
assimilation efficiencies possible (95). If group- 
specific predation (and photosynthetic inputs to 
growth) can be accurately estimated, the influences 
of dissolved inorganic nutrients and remineral- 
ization rates can be better assessed. 

Little is known about how biotic and abiotic 
factors influence where a protist resides along 
the spectrum between heterotrophic and photo- 
autotrophic growth at any given time, or species- 
level variability. Experiments are difficult, in part 
because most cultured predators, mixotrophic and 
otherwise, reside with prey communities and 
symbionts (if present) that restrict defined ex- 
perimentation. Moreover, many groups are not 
represented in culture. Uncultured piymnesiophytes 
contribute significantly to open-ocean primary 
production {30, 31, 94). Some of these same 
piymnesiophytes are mixotrophic {93, 94) and 
consume Prochlorococcus, the most abundant pho- 
tosynthetic organism on the planet (and presum- 
ably prey upon other picoplankton as well). If such 
oiganisms shift trophic modes opportunistically— 
perhaps to capture resources through predation 
when that strategy becomes more favorable— then 
the environmental triggers behind such a shift 
fundamentally reshape their ecosystem roles and 
food web dynamics. 

Predatory protists, whether mixotrophic or 
strictly heterotrophic, will likely be affected by 
ocean change through direct temperature effects 
and by altered composition of prey. The mixo- 
trophic chiysophyte Ochromonas switches to more 
heterotrophic nutrition under elevated tempera- 
tures, although it remains unknown whether 
higher CO 2 availability reverses this effect {96). A 
shift toward smaller phytoplankton, as observed 


with warming in the Arctic and in ocean acidifi- 
cation experiments, favors nanoflagellate grazers 
over larger predators {50, 97). Altered phytoplank- 
ton stoichiometry (due to increased CO 2 availa- 
bility) also affects copepod consumption rates, 
modifying growth and the fraction of organic car- 
bon converted to CO 2 or excreted as DOM {98). 

There are multiple further complications. For 
example, not all phototrophs rely on canonical 
oxidative photosynthesis. The light-driven pro- 
ton pump proteorhodopsin, known from some 
marine bacteria (99), occurs in photosynthetic 
and heterotrophic eukaryotes {100). Its distribu- 
tion and function are still poorly understood, but 
new copies are being found in additional line- 
ages. Marine proteorhodopsins derived from eu- 
karyotes were recovered in metagenomic surveys 
but were misinterpreted as being bacterially de- 
rived, until they were found and characterized 
in cultivated protists {100). This highlights the 
power of meta-omic approaches as well as the 
important role of reference data from cultured 
taxa in the process of extrapolating function 
from metagenomic data. 

The role of behavior: Ecological 
capabilities beyond carbon processing 

Typically, bacterial and archaeal diversity is mani- 
fested at the level of metabolism, whereas eu- 
karyotic diversity is strongly influenced by their 
cellular structures and, by extension, behavior 
{101, 102). Bacteria and archaea do use sensing 
and motility (chemotaxis) to optimize their loca- 
tion and communicate by means of pheromones 
{103). Nonetheless, our view of how prokaryotes 
contribute to the marine carbon cycle (and food 
webs in general) is dominated by analyzing the sum 
of the enzymes and biochemical pathways they 
express, even when microscale heterogeneity (it- 
self influenced by eukaryotic cells’ activities, death, 
and aggregation) is taken into account {104-106). 
Comparative methods can be used to extrapolate 
these enzymes and pathways from genomic and 
metagenomic data reasonably well, which in turn 
allows hypotheses to be generated on their ecolog- 
ical roles, to be generated laigely from sequence data. 

For eukaryotes, the balance between metabo- 
lism and behavior is tipped in the opposite direction: 
Metabolism maybe diverse, but environmental 
interactions are strongly guided by cellular struc- 
tures and the behaviors they underpin. Protists 
can build traps and feeding apparatuses; they 
not only move about but often actively hunt down 
specific prey (57, 107). They also take actions to 
evade hunters or to build defenses against them. 
Protists actively capture other cells, take them 
up, and digest them or domesticate them (see 
below). These characteristics emerge from pro- 
tein functions and interactions that are several 
degrees further removed from genomic sequence 
data than are enzymatic pathways. As a result, 
we are far less adept at reconstructing cellular 
structures and behavior from sequence data— or, 
by extension, the role of diverse eukaryotic mi- 
crobes in the marine carbon cycle. 

Of course there are exceptions; many eukar- 
yotes take advantage of metabolic versatility much 


Box 2. Lifestyles and specialized interaction zones. 

(i) Primary producer: Generates organic carbon by photosynthesis and CO 2 fixation 
(the role traditionally played by phytoplankton). 

(ii) Osmotrophy: Cells take up organic material from the external environment as small 
molecules or macromolecules. 

(iii) Saprotrophy: A mode of osmotrophy whereby extracellular digestion is involved in 
processing or recycling organic matter, also often referred to as chemoheterotrophy. 

(iv) Predation: Consumption of prey cells, often by phagocytosis. 

(v) Mixotrophic predation: A combination of phototrophic and heterotrophic metabolism. 

In some cases, mixotrophs may alternate between trophic mechanisms (e.g., predation, 
which can result in release of CO 2 , and photosynthesis, resulting in capture of CO 2 ). 

(vi) Parasitism: Defined here as one organism existing in association with another to the 
detriment of that partner (Fig. 3 inset shows a dinoflagellate infected by Amoebophrya). 

(vii) Phycosphere: A region surrounding phytoplankton cells that results from the basic 
physics of the diffusive boundary layer and has a higher concentrations of organic matter 
(produced by the eukaryote) than local waters. This zone attracts free-living prokaryotes, 
some of which may attach (and could also include other eukaryotes). The zone of influence 
in the fluid medium is disproportionately greater for larger phytoplankton cells and is considered 
akin to the rhizosphere (Fig. 3 inset depicts a diatom phycosphere containing bacteria). 

(viii) Symbioses: Defined here as mutualistic relationships where one species lives on or within 
another species (Fig. 3 inset shows a diatom with N 2 -fixing cyanobacteria on its spines). 
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as bacteria do (e.g., fungi), and some ecologically 
important activities are readily interpreted from 
genomic data (e.g., photosynthesis). But, as we have 
seen above, we lack the catalog of gene functions 
needed to recognize many trophic behaviors, or 
the mechanisms that control shifts between trophic 
modes. This problem is particularly acute for free- 
living protistan heterotrophs and mixotrophs, 
many of which are the most structurally complex 
but least investigated microbial cells. 

Complexity through associations: 
Phycosphere, symbioses, and viruses 

Heterogeneities in carbon pools will also be af- 
fected by physical associations, some of which 
are mutualistic. Bacteria (and potentially archaea 
and protists) actively detect and take up exuded 
extracellular products in the zone around phyto- 
plankton known as the phycosphere (Box 2 and 
Fig. 3) {106, 108-110). The associated bacteria can 
be attached, or motile and unattached, but so far 
phycosphere interactions and the molecules ex- 
changed have proven difficult to discern. 

Symbiotic associations involving protists are 
also well known, although overlooked in micro- 
bial food web dynamics. Many involve metabo- 
lite exchanges, sometimes where the protist is a 
symbiont within an animal (e.g., algae in corals), 
or where heterotrophic protists play host to bacte- 
rial or eukaryotic algal symbionts, or algal or- 
ganelles retained by Meptoplastidy {111-114). Thus, 
mixotrophy is not restricted to phytoplankton 
capable of phagocytosis, but is also widespread 
among heterotrophic ciliates, dinoflagellates, fo- 
raminifera, and acantharians {59, 115). Obligate 
N 2 -fixing cyanobacterial symbionts are present in 
some diatoms, dinoflagellates, and piymnesiophytes 
{116-118), although destructive sampling of the 
latter initially led to the mistaken conclusion that 
the symbionts were free-living cells, thus empha- 
sizing the value of visual observation. These sym- 
bioses allow goods exchanges, including newly 
fixed N, that represent alternative resource ac- 
quisition strategies for the hosts. They thereby 
stimulate productivity and influence overall nitro- 
gen availability {119). 

Viruses play important but unquantified roles 
in structuring microbial communities and inter- 
rupt flows of carbon and nutrients through the 
microbial loop {10, 120-122). Labile DOM gener- 
ated through viral-mediated cell lysis can be rap- 
idly assimilated and remineralized by microbes, 
promoting nutrient retention in the euphotic zone 
and potentially decreasing the efficiency of car- 
bon export to the deep sea {121-123). However, 
infection of photosynthetic protists can also en- 
hance production of exudates, which stimulate 
particle aggregation and facilitate export {123, 124). 
Notably, viruses can influence predator popula- 
tions directly by infection and lysis or by reduc- 
ing prey availability {122, 125). Indirect effects 
involve increased bacterial growth on released 
DOM and POM, resulting in community changes. 
Moreover, viruses themselves may serve as food 
particles. 

Like their hosts, fundamental differences be- 
tween eukaryotic viruses and phages necessitate 


separate empirical methods for characterization 
and explicit inclusion in marine food webs (Fig. 3). 
Some eukaryotic viruses appear to exploit mech- 
anisms of programmed cell death during lysis 
{124), and some may promote sexual cycling in 
species where ploidy level determines viral sus- 
ceptibility {126), but this and other resistance 
mechanisms are not well understood {127). An 
extreme example of how marine eukaryotic vi- 
ruses differ from phages is the giant virus that in- 
fects a predatory stramenopile alongside a smaller 
‘Virophage” that parasitizes the giant virus {128). 
The diversity of protists and their viruses may 
preclude broad commonalities for several as- 
pects of interactions {121). Challenges moving 
forward include linking natural viruses (and viral 
sequences) to actual hosts, examining viruses as 
food particles, and incorporating the nuances 
of species-specific viral interactions in ecosystem 
models. 

Diverse data for a complex problem 

An overwhelming volume of sequence data is be- 
ing collected and analyzed from marine microbes. 
Its value for characterizing protistan contribu- 
tions to autotrophic and heterotrophic function- 
ality, and teasing apart their interactions with 
other microbes and consequent influences on 
biogeochemical cycles, will be maximized by 
tools and approaches tailored to the unique biol- 
ogy of protists. Statistical approaches that detect 
interactions between organisms using co-occurrence 
networks can help development of hypotheses 
about the ecology of eukaryotic plankton {129, 130). 
Massive sequencing of marker genes (or even 
metagenomic and metatranscriptomic sequen- 
ces) provides such data, with the caveat that for 
success large numbers of samples are needed, 
as are experimental designs that adequately 
cover key environmental variables and involve 
rate measurements. 

Although rare, studies designed to capture 
physical associations between cells provide crit- 
ical avenues for validating ecological interaction 
networks. For example, genome reconstruction 
can help to identify traded services, the hallmark 
of symbioses. Thus, “single cell” sequencing with 
enough coverage to highlight key metabolic path- 
ways absent from one but present in the other 
partner can provide evidence of symbiosis. A ro- 
bust example with relevance to global geochemical 
models is N 2 fixation by symbionts {116-118). 
However, in cases involving new and unknown 
interactions, sequencing alone may not resolve 
the true nature of interactions, but rather can 
lead to (mis)interpretation as symbiosis, contam- 
ination, predation, presence of osmotrophically 
degraded material, infection, or even a horizon- 
tal gene transfer {131). 

A few “rules” that have ensured success in 
meta-omic studies are not as readily met by pro- 
tists. These are (i) having a reference genome to 
help recruit relevant sequences [not the case for 
most marine protists because only ~20 genomes 
are available]; (ii) having a small genome and 
sufficient sequencing depth [genome sizes from 
free-living protists vary from ~10 MB to much 


larger than the human genome]; and (iii) having 
low system diversity (i.e., few taxa present)— a 
situation sometimes true in extreme environments, 
but not commonly so. Eukaryotes present other 
types of genome complexity as well, such as re- 
gions of widely varied composition that compli- 
cate binning without a reference genome {89) 
and “mosaic” phylogenetic signals that require 
deciphering complex evolutionary histories 
{2, 31, 43, 44, 132). Transcriptome sequencing 
and assembly has improved protistan reference 
databases while minimizing genome structural 
complexity issues {133). Another strategy used to 
sidestep such issues for wild planktonic protists 
is cell sorting based on optical signatures, which 
raises coverage of target organisms by suppressing 
diversity, making genome assembly more likely 
{26, 31, 62, 82, 134-136). Microfluidic approaches 
that sort cells by imaging and thereby link ge- 
nomes to morphology will further surmount these 
problems {136). Higher-throughput applications 
of these technologies are needed to expand ge- 
nomic representation of protistan complexity. 

Biology has much to gain from expanded ge- 
nomic coverage of protistan diversity, especially 
as a broader range of trophic strategy variants is 
investigated. Marine protistan genomes sequenced 
thus far have resulted in massive strides in under- 
standing the origins of multicellularity {137, 138), 
intron evolution {139, 140), genome integration 
and control {43), and even fundamental aspects 
of methylation, nucleosome positioning, and chro- 
matin compaction {141). Major gains will also be 
made for marine phytoplankton, given that the 
evolutionary distance between the two genome- 
sequenced diatom genera {Thalassiosira and 
Phaeodactylum) is greater than that between 
humans and fish {132), and marker analysis of 
different clades within the prasinophyte genus 
Micromonas shows divergence equivalent to the 
separation between com and rice {139, 142). 

Still, genomics has limits. So, as questions 
about the carbon cycling activities of protists are 
formulated in greater detail, we need to ask: Where 
are new strategies needed? One such area involves 
functional studies on ecologically important model 
organisms in environmentally relevant conditions. 
For example, omic data facilitated the discovery 
that microbes previously considered dependent 
on exogenous vitamin Bi grow in its absence, but 
only after pathway gap theory {143) was imple- 
mented together with experiments on cultured 
species {144, 145). What was thought to represent 
(incomplete) pathways in the evolutionary process 
of reduction is actually a noncanonical vitamin 
pathway dependent on the import of a precursor 
that can be more abundant in seawater than the 
vitamin itself. Once understood in a few cultured 
organisms, the B^ (thiamine) metabolism of di- 
verse planktonic bacteria and eukaryotic phyto- 
plankton could be reinterpreted {144, 145). These 
discoveries change “facts” regarding phytoplank- 
ton bloom controls and reveal the likely involve- 
ment of several proteins of unknown function in 
thiamine biosynthesis. Such studies illustrate that 
the power of comparative approaches is contin- 
gent on reference data, but more importantly that 
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the greatest gain is harvested from omic data sets 
when paired ^vith detailed studies on the biology 
of strategically chosen marine taxa. 

One of the biggest obstacles to interpreting 
protistan genomes is the dearth of information 
about the evolution of proteins that control behav- 
iors such as mixotrophy, predation, and parasi- 
tism, as v^ell as regulatory strategies— for example, 
those that control energy allocation (146) to 
optimize growth. The recent discovery of exten- 
sive oxidative metabolism for one-carbon and 
methylated compounds in the hyperabundant 
Pelagibacterales clade of marine bacteria illustrates 
how geochemically important information lay 
v^thin plain view in genomes for a decade before 
being deciphered (14T). This discovery focused 
attention on a neglected but apparently impor- 
tant sector of the carbon cycle: methylated and 
volatile compounds. This brings us to a second 
answer to the ‘Where are new strategies needed?” 
question. A great deal can be garnered from 
pathway gap theory, but in the end, studies that 
infer higher-order function from genomes are 
founded on experimental proof of protein func- 
tions. One of the biggest impediments to systems 
biology and ecosystems biology (104, 148) ap- 
proaches is that the majority of important marine 
protistan groups (even algae) lack representatives 
with established genetic systems. Robust but rela- 
tively low-efficiency manipulation systems exist 
for two algae v^th limited marine distributions: 
the diminutive picoeukaryote Ostreococciis tauri 

(149) and the diatom Phaeodactylum trimmutum 

(150) . New knowledge on marine protistan diver- 
sity and distributions calls for concerted initia- 
tives to develop genetic systems that taiget ecologically 
relevant lineages from across the eukaryotic 
tree, and to develop parallel technologies such as 
CRISPR (clustered regularly interspaced short 
palindromic repeat) and TALEN (transcription 
activator-like effector nuclease)-based gene edit- 
ing. These, as well as sampling that preserves 
microbe-microbe interactions, improved visualiza- 
tion, and experiments in advanced bioreactors 
that allow testing of multifactorial impacts, precise 
modulation of CO 2 , light, and nutrients, and real- 
time measurements of physiological parameters, 
are key steps forward. 

Ecological and biogeochemical modeling 

Rapidly advancing bioinformatic and statistical 
approaches must be complemented by mecha- 
nistic models that encapsulate and synthesize 
understanding of complex systems at cellular, 
community, and global scales. Marine protistan 
diversity presents challenges for theory and mod- 
eling: Can we understand the rich and flexible 
ecology of these functionally diverse organisms 
to interpret and simulate it with mathematical 
and numerical models? What facets of their 
biology affect carbon flow and sequestration 
sufficiently to warrant inclusion in climate and 
carbon cycle models? Consider mixotrophy: Now 
that field studies show its prevalence [e.g., (32, 94)1, 
should global carbon cycle models that currently 
only separate autotrophy and heterotrophy cap- 
ture this complexity? To date, the costs and ben- 


efits of mixotrophic predation have been addressed 
by few studies (151-153). This emphasizes the 
need to evaluate trophic strategies v^th data and 
modeling, explore their biogeochemical impor- 
tance and food web impacts, and assess how they 
modify predictions by global models. 

Modeling paradigms generally resolve a few 
coarse phytoplankton phenotypes ^vith highly sim- 
plified carbon and nutrient flows between them. 
Currently, diversity ^vithin a few functional types 
related to cell size or temperature adaptations 
can also be resolved (14). Yet many chemical ex- 
changes occur between protists and their neigh- 
bors; are these simplified descriptions sufficient 
to describe the system? Of particular interest is 
how diversity and connectivity affect system 
stability— a question long debated in the context 
of food webs (154). The ability to predict how 
plankton populations and their biogeochemical 
roles respond to a changing environment is tight- 
ly linked to this question. For example, in a recent 
five-box model, the relative abundance of fast- 
and slow-grovdng phytoplankton taxa controlled 
community elemental composition and new nitro- 
gen import rates (155). In another study, inclusion 
of contributions from various system members 
(including microbes) helped to reconcile aphotic 
zone flux terms that were long out of balance 
(156). These studies highlight the need for im- 
proved input data (even relating to the basic el- 
emental stoichiometry of different algae), but also 
demonstrate that idealized models can be tools 
for exploring how complex networks behave and 
for generating hypotheses that can be tested in 
the lab and field. 

The black-box descriptions of physiology used 
in current ecosystem models neglect the power- 
ful constraints of redox and energy balance that 
become explicit once cellular metabolism is re- 
solved. Environmental engineers have exploited 
these constraints using schematic metabolic net- 
works. Flux balance analysis (157), a tool from 
systems biology, extends this approach to provide 
mechanistic, quantitative predictions of grov^h 
rates and efficiencies rooted in genome-scale meta- 
bolic reconstructions. Such a goal requires consid- 
erable advances in protein function annotations, 
as well as rigorous testing with model organisms. 
However, looking forward, the application of flux 
balance analysis in models of marine ecosystems 
^vill enable a vastly richer repertoire of applica- 
tions and v^ll bring empirical and theoretical 
perspectives together ^vith a common language. 

Conclusions 

The complexity of protistan behaviors v^ll test 
the perspective that mechanistic models of cell 
functions can be merged with global models to 
understand marine ecosystems. How scientists 
v^ll ^vinnow this complexity to find those kernels 
of protistan biology most important for ocean 
ecosystem-level processes and carbon cycling re- 
mains to be seen. The diverse biology of protists 
is not yet understood well enough to fully eval- 
uate their whole impact on biogeochemical mod- 
els, but enough is knovm to say their roles must 
be reevaluated. Protistan genomes and behaviors 


are innately challenging to decipher but are es- 
sential to dissecting trophic connections. Tech- 
nologies that overcome scale-related issues of 
sequencing and proteome analyses are helping, 
as is the application of network theory and mod- 
eling to identify ecological links and generate 
testable hypotheses. As hypothetical links are 
identified, the challenge is to validate them by 
characterizing the underlying mechanisms, con- 
strain them by empirical measurements, and 
parameterize them into global biogeochemical 
models. The most direct source of data on troph- 
ic modes and interactions will come from study- 
ing cells at multiple levels, integrating cell and 
structural biology, physiology, and behavior. Com- 
bining investigations of cellular systems with a 
broader sampling of eukaryotic diversity will 
empower comparative approaches ^vith the prom- 
ise of substantial progress in quantifying activ- 
ities of protists and their impacts on the global 
carbon cycle. 
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INTRODUCTION: There is an urgent need for 
a new paradigm that integrates the continued 
development of human societies and the main- 
tenance of the Earth system (ES) in a resilient 
and accommodating state. The planetary bound- 
ary (PB) framework contributes to such a 
paradigm by providing a science-based analysis 
of the risk that human perturbations will de- 
stabilize the ES at the planetary scale. Here, the 
scientific underpinnings of the PB framework 
are updated and strengthened. 

RATIONALE: The relatively stable, 11,700-year- 
long Holocene epoch is the only state of the ES 


that we know for certain can support contem- 
porary human societies. There is increasing evi- 
dence that human activities are affecting ES 
functioning to a degree that threatens the re- 
silience of the ES— its ability to persist in a 
Holocene-like state in the face of increasing 
human pressures and shocks. The PB frame- 
work is based on critical processes that reg- 
ulate ES functioning. By combining improved 
scientific understanding of ES functioning with 
the precautionary principle, the PB framework 
identifies levels of anthropogenic perturbations 
below which the risk of destabilization of the 
ES is likely to remain low— a “safe operating 


Climate change 
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divers' 


Land-system 

change 


Freshwater use 
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Biochemical flows 
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Novel entities 


Stratospheric ozone depletion 


Atmospheric aerosol loading 
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■ Beyond zone of uncertainty (high risk) B Below boundary (safe) 

■ In zone of uncertainty (increasing risk) Boundary not yet quantified 


Current status of the control variables for seven of the planetary boundaries. The green zone 
is the safe operating space, the yellow represents the zone of uncertainty (increasing risk), and the 
red is a high-risk zone. The planetary boundary itself lies at the intersection of the green and yellow 
zones. The control variables have been normalized for the zone of uncertainty: the center of the 
figure therefore does not represent values of 0 for the control variables. The control variable shown 
for climate change is atmospheric CO 2 concentration. Processes for which global-level boundaries 
cannot yet be quantified are represented by gray wedges: these are atmospheric aerosol loading, 
novel entities, and the functional role of biosphere integrity. 


space” for global societal development. A zone 
of uncertainty for each PB highlights the area 
of increasing risk. The current level of anthro- 
pogenic impact on the ES, and thus the risk to 
the stability of the ES, is assessed by compar- 
ison with the proposed PB (see the figure). 


RESULTS: Three of the PBs (climate change, 
stratospheric ozone depletion, and ocean acid- 
ification) remain essentially unchanged from 
the earlier analysis. Regional-level boundaries 
as well as globally aggregated PBs have now 
been developed for biosphere integrity (earlier 
“biodiversity loss”), biogeochemical flows, land- 
system change, and flreshwater use. At present, 
only one regional boundary (south Asian mon- 
soon) can be established for atmospheric aerosol 
loading. Although we cannot identify a single PB 
for novel entities (here de- 
fined as new substances, 
new forms of existing sub- 
stances, and modified life 
forms that have the po- 
tential for unwanted geo- 
physical and/or biological 
effects), they are included in the PB framework, 
given their potential to change the state of the 
ES. Two of the PBs— climate change and bio- 
sphere integrity— are recognized as “core” PBs 
based on their fimdamental importance for the 
ES. The climate system is a manifestation of the 
amount, distribution, and net balance of energy 
at Earth’s surface; the biosphere regulates ma- 
terial and energy flows in the ES and increases 
its resilience to abrupt and gradual change. 
Anthropogenic perturbation levels of four of 
the ES processes/features (climate change, bio- 
sphere integrity, biogeochemical flows, and land- 
system change) exceed the proposed PB (see the 
flgure). 
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CONCLUSIONS: PBs are scientifically based 
levels of human perturbation of the ES beyond 
which ES functioning may be substantially 
altered. Transgression of the PBs thus creates 
substantial risk of destabilizing the Holocene 
state of the ES in which modern societies have 
evolved. The PB framework does not dictate 
how societies should develop. These are po- 
litical decisions that must include considera- 
tion of the human dimensions, including equity, 
not incorporated in the PB firamework. Never- 
theless, by identifying a safe operating space 
for humanity on Earth, the PB framework 
can make a valuable contribution to decision- 
makers in charting desirable courses for socie- 
tal development. ■ 
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The planetary boundaries framework defines a safe operating space for humanity based 
on the intrinsic biophysical processes that regulate the stability of the Earth system. 
Here, we revise and update the planetary boundary framework, with a focus on the 
underpinning biophysical science, based on targeted input from expert research 
communities and on more general scientific advances over the past 5 years. Several of the 
boundaries now have a two-tier approach, reflecting the importance of cross-scale 
interactions and the regional-level heterogeneity of the processes that underpin the 
boundaries. Two core boundaries— climate change and biosphere integrity— have been 
identified, each of which has the potential on its own to drive the Earth system into a new 
state should they be substantially and persistently transgressed. 


T he planetary boundary (PB) approach (i, 2) 
aims to define a safe operating space for 
human societies to develop and thrive, based 
on our evolving understanding of the func- 
tioning and resilience of the Earth system. 
Since its introduction, the framework has been 
subject to scientific scrutiny [e.g., (3-7)] and has 
attracted considerable interest and discussions 
within the policy, governance, and business sec- 
tors as an approach to inform efforts toward glob- 
al sustainability {8-10). 

In this analysis, we further develop the basic 
PB framework by (i) introducing a two-tier ap- 
proach for several of the boundaries to account 
for regional-level heterogeneity; (ii) updating the 
quantification of most of the PBs; (iii) identifying 
two core boundaries; and (iv) proposing a regional- 
level quantitative boundary for one of the two 
that were not quantified earlier (i). 

The basic framework: Defining 
a safe operating space 

Throughout history, humanity has faced environ- 
mental constraints at local and regional levels, 
with some societies dealing with these challenges 
more effectively than others {11, 12). More recent- 
ly, early industrial societies often used local water- 
ways and airsheds as dumping grounds for their 
waste and effluent from industrial processes. This 
eroded local and regional environmental quality 
and stability, threatening to undermine the pro- 
gress made through industrialization by damag- 
ing human health and degrading ecosystems. 
Eventually, this led to the introduction of local 
or regional boundaries or constraints on what 


could be emitted to and extracted from the en- 
vironment (e.g., chemicals that pollute airsheds 
or waterways) and on how much the environment 
could be changed by direct human modification 
(land-use/cover change in natural ecosystems) 
{13). The regulation of some human impacts on 
the environment— for example, the introduction 
of chemical contaminants— is often framed in 
the context of “safe limits” {14). 

These issues remain, but in addition we now 
face constraints at the planetary level, where the 
magnitude of the challenge is vastly different. 
The human enterprise has grown so dramatically 
since the mid-20th century (15) that the relatively 
stable, 11,700-year-long Holocene epoch, the only 
state of the planet that we know for certain can 
support contemporary human societies, is now 
being destabilized (figs. SI and S2) {16-18). In 
fact, a new geological epoch, the Anthropocene, 
has been proposed {19). 

The precautionary principle suggests that hu- 
man societies would be unwise to drive the Earth 
system substantially away from a Holocene-like 
condition. A continuing trajectory away from the 
Holocene could lead, with an uncomfortably high 
probability, to a very different state of the Earth 
system, one that is likely to be much less hos- 
pitable to the development of human societies 
{17, 18, 20). The PB framework aims to help guide 
human societies away from such a trajectory by 
defining a “safe operating space” in which we can 
continue to develop and thrive. It does this by 
proposing boundaries for anthropogenic pertur- 
bation of critical Earth-system processes. Respect- 
ing these boundaries would greatly reduce the 


risk that anthropogenic activities could inadver- 
tently drive the Earth system to a much less hos- 
pitable state. 

Nine processes, each of which is clearly being 
modified by human actions, were originally sug- 
gested to form the basis of the PB framework {!). 
Although these processes are fundamental to 
Earth-system functioning, there are many other 
ways that Earth-system functioning could be de- 
scribed, including potentially valuable metrics 
for quantifying the human imprint on it. These 
alternative approaches [e.g., (4)] often represent 
ways to explore and quantify interactions among 
the boundaries. They can provide a valuable com- 
plement to the original approach {!) and further 
enrich the broader PB concept as it continues to 
evolve. 

The planetary boundary 
framework: Thresholds, feedbacks, 
resilience, uncertainties 

A planetary boundary as originally defined {!) is 
not equivalent to a global threshold or tipping 
point. As Fig. 1 shows, even when a global- or 
continental/ocean basin-level threshold in an 
Earth-system process is likely to exist [e.g., {20, 2i)], 
the proposed planetary boundary is not placed 
at the position of the biophysical threshold but 
rather upstream of it— i.e., well before reaching 
the threshold. This buffer between the boundary 
(the end of the safe operating space, the green 
zone in Fig. 1) and the threshold not only ac- 
counts for uncertainty in the precise position of 
the threshold with respect to the control variable 
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Process X 


Process Y 


Globally mixed, with continental/global threshold 


Heterogeneous, with no continental/global threshold 


Planetary Planetary 



m Safe operating space |||| Zone of uncertainty: Increasing risk of impacts Dangerous level: High risk of serious impacts 

Fig. 1. The conceptual framework for the planetary boundary approach, showing the safe operating space, the zone of uncertainty, the position of 
the threshold (where one is likely to exist), and the area of high risk. Modified from (1). 


but also allows society time to react to early warn- 
ing signs that it may be approaching a thresh- 
old and consequent abrupt or risky change. 

The developing science of early-warning signs 
can warn of an approaching threshold or a de- 
crease in the capability of a system to persist 
under changing conditions. Examples include 
“critical slowing down” in a process (22), in- 
creasing variance (23), and flickering between 
states of the system (24-26). However, for such 
science to be useful in a policy context, it must 
provide enough time for society to respond in 
order to steer away from an impending thresh- 
old before it is crossed (27, 28). The problem of 
system inertia— for example, in the climate sys- 
tem (iS)— needs to be taken into account in as- 
sessing the time needed for society to react to 
early-warning signs. 

Not all Earth-system processes included in the 
PB approach have singular thresholds at the global/ 
continental/ocean basin level (1). Nevertheless, it 
is important that boundaries be established for 
these processes. They affect the capacity of the 
Earth system to persist in a Holocene-like state 
under changing conditions (henceforth “resilience”) 
by regulating biogeochemical flows (e.g., the ter- 
restrial and marine biological carbon sinks) or by 
providing the capacity for ecosystems to tolerate 
perturbations and shocks and to continue func- 
tioning under changing abiotic conditions (29, 30). 
Examples of such processes are land-system 
change, freshwater use, change in biosphere in- 
tegrity [rate of biodiversity loss in (1, 2)], and 
changes in other biogeochemical flows in addi- 
tion to carbon (e.g., nitrogen and phosphorus). 
Placing boundaries for these processes is more 


difficult than for those with known large-scale 
thresholds (21) but is nevertheless important for 
maintaining the resilience of the Earth system as 
a whole. As indicated in Fig. 1, these processes, 
many of which show threshold behavior at local 
and regional scales, can generate feedbacks to 
the processes that do have large-scale thresholds. 
The classic example is the possible weakening of 
natural carbon sinks, which could further de- 
stabilize the climate system and push it closer to 
large thresholds [e.g, loss of the Greenland ice 
sheet (iS)]. An interesting research question of 
relevance to the PB framework is how small- 
scale regime shifts can propagate across scales 
and possibly lead to global-level transitions (31, 32). 

A zone of uncertainty, sometimes large, is as- 
sociated with each of the boundaries (yellow zone 
in Fig. 1). This zone encapsulates both gaps and 
weaknesses in the scientific knowledge base and 
intrinsic uncertainties in the functioning of the 
Earth system. At the “safe” end of the zone of un- 
certainty, current scientific knowledge suggests 
that there is very low probability of crossing a 
critical threshold or substantially eroding the re- 
silience of the Earth system. Beyond the “danger” 
end of the zone of uncertainty, current knowl- 
edge suggests a much higher probability of a 
change to the functioning of the Earth system 
that could potentially be devastating for human 
societies. Application of the precautionary prin- 
ciple dictates that the planetary boundary is set 
at the “safe” end of the zone of uncertainty. This 
does not mean that transgressing a boundary will 
instantly lead to an unwanted outcome but that 
the farther the boundary is transgressed, the 
higher the risk of regime shifts, destabilized sys- 


tem processes, or erosion of resilience and the 
fewer the opportunities to prepare for such 
changes. Observations of the climate system show 
this principle in action by the influence of in- 
creasing atmospheric greenhouse gas concentra- 
tions on the frequency and intensity of many 
extreme weather events (17, 18). 

Linking global and regional scales 

PB processes operate across scales, from ocean 
basins/biomes or sources/sinks to the level of the 
Earth system as a whole. Here, we address the 
subglobal aspects of the PB framework. Rock- 
strdm et al. (1) estimated global boundaries on- 
ly, acknowledging that the control variables for 
many processes are spatially heterogeneous. That 
is, changes in control variables at the subglobal 
level can influence functioning at the Earth- 
system level, which indicates the need to define 
subglobal boundaries that are compatible with 
the global-level boundary definition. Avoiding 
the transgression of subglobal boundaries would 
thus contribute to an aggregate outcome within 
a planetary-level safe operating space. 

We focus on the five PBs that have strong re- 
gional operating scales: biosphere integrity, biogeo- 
chemical flows [earlier termed “phosphorus (P) 
and nitrogen (N) cycles” (1, 2)'], land-system change, 
freshwater use, and atmospheric aerosol loading. 
Table SI describes how transgression of any of 
the proposed boundaries at the subglobal level 
affects the Earth system at the global level. 

For those processes where subglobal dynamics 
potentially play a critical role in global dynamics, 
the operational challenge is to capture the im- 
portance of subglobal change for the functioning 
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of the Earth system. To do this, we propose the 
development of a two-level set of control var- 
iables and boundaries. The subglobal-level units 
of analysis for these six boundaries are not 
identical; they vary according to the role that the 
processes play in the Earth system: (i) changes 
in biosphere integrity occur at the level of land- 
based biomes, large freshwater ecosystems, or 
major marine ecosystems as the largest sub- 
global unit; (ii) the role of direct, human-driven 
land-system change in biophysical climate regu- 
lation is primarily related to changes in forest 
biomes; (iii) freshwater flows and use occur at 
the largest subglobal level in the major river 
basins around the world; and (iv) changes in 
biogeochemical flows, exemplified by phospho- 
rus and nitrogen cycling, aggregate from rela- 
tively localized but very severe perturbations 
in intensive agricultural zones to affect global 
flows of nutrients. We recognize these as crit- 
ical regions for Earth-system functioning. Where 
appropriate, the updates of the individual bound- 
aries (see below) {33) now contain both the glob- 
ally aggregated boundary value of the control 
variable and its regional distribution function. 
Figure 2 shows the distributions and current 
status of the control variables for three of the 
boundaries where subglobal dynamics are crit- 


ical: biogeochemical cycles, land-system change, 
and freshwater use. 

We emphasize that our subglobal-level focus is 
based on the necessity to consider this level to 
understand the functioning of the Earth system 
as a whole. The PB framework is therefore meant 
to complement, not replace or supersede, efforts 
to address local and regional environmental issues. 

Updates of the individual boundaries 

Brief updates of all nine of the PBs are given in 
this section, and more detailed descriptions of 
the updates for three of the PBs that have under- 
gone more extensive revision can be found in {33). 
The geographical distribution issues discussed 
above are particularly important for five of the 
PBs, and their control variables and boundaries 
have been revised accordingly (Table 1). Figure 3 
shows the current status of the seven bounda- 
ries that can be quantified at the global level. 

Climate change 

We retain the control variables and boundaries 
originally proposed— i.e., an atmospheric CO 2 con- 
centration of 350 parts per million (ppm) and an 
increase in top-of-atmosphere radiative forcing of 
+1.0 W m“^ relative to preindustrial levels {!). 
The radiative forcing control variable is the more 


inclusive and fundamental, although CO 2 is im- 
portant because of its long lifetime in the atmo- 
sphere and the very large human emissions. 
Human-driven changes to radiative forcing in- 
clude all anthropogenic factors: CO 2 , other green- 
house gases, aerosols, and other factors that 
affect the energy balance {18). Radiative forcing 
is generally the more stringent of the two bound- 
aries, although the relationship between it and 
CO 2 can vary through time with changes in the 
relative importance of the individual radiative 
forcing factors. 

Evidence has accumulated to suggest that the 
zone of uncertainty for the CO 2 control variable 
should be narrowed from 350 to 550 ppm to 350 
to 450 ppm CO 2 {17, 18), while retaining the cur- 
rent zone of uncertainty for radiative forcing of 
+1.0 to 1.5 W m“^ relative to preindustrial levels. 
Current values of the control variables are 399 ppm 
CO 2 (annual average concentration for 2014) {34) 
and +2.3 W m“^ (1.1 to 3.3 W m"^) in 2011 relative 
to 1750 {18). Observed changes in climate at cur- 
rent levels of the control variables confirm the 
original choice of the boundary values and the 
narrowing of the zone of uncertainty for CO 2 . For 
example, there has already been an increase in 
the intensity, frequency, and duration of heat 
waves globally (35); the number of heavy rainfall 


A Phosphorus 



C Land-system change 




-Mir 

\ f'V 




B Nitrogen 



D Freshwater use 



■ Beyond zone of uncertainty (high risk) ■ In zone of uncertainty (increasing risk) ■ Below boundary (safe) 

Fig. 2. The subglobal distributions and current status of the control variables for (A) biogeochemical flows of P; (B) biogeochemical flows of N; (C) land- 
system change: and (D) freshwater use. In each panel, green areas are within the boundary (safe), yellow areas are within the zone of uncertainty (increasing 
risk), and red areas are beyond the zone of uncertainty (high risk). Gray areas in (A) and (B) are areas where P and N fertilizers are not applied: in (C), they are 
areas not covered by major forest biomes: and in (D), they are areas where river flow is very low so that environmental flows are not allocated. See Table 1 for 
values of the boundaries and their zones of uncertainty and (33) for more details on methods and results. 
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Table 1. The updated control variables and their current values, along with the proposed boundaries and zones of uncertainty, for all nine planetary 
boundaries. In the first column, the name for the Earth-system process used in the original PB publication (R2009, reference 1) is given for comparison. 

Earth-system Control Planetary boundary Current value of 

process variable(s) (zone of uncertainty) control variable 


Clinnate 

Atmospheric CO 2 

350 ppm CO 2 (350-450 ppm) 

398.5 ppm CO 2 

change 

(R2009: 

concentration, ppm 



same) 

Energy imbalance 

+ 1.0 W m-2 (+1.0-1.5 W m-2) 

2.3 W m-2 


at top-of- 

atmosphere, W m“^ 


(1.1-3.3 W m-2) 

Change in 

Genetic diversity: 

< 10 E/MSY (10-100 E/MSY) 

100-1000 E/MSY 

biosphere 

Extinction rate 

but with an aspirational goal of 


integrity 


ca. 1 E/MSY (the background 


(R2009: 


rate of extinction loss). E/MSY = 


Rate of 


extinctions per million species-years 


biodiversity 

loss) 

Functional diversity: 

Maintain BN at 90% (90-30%) 

84%, applied to 


Biodiversity 

or above, assessed 

southern Africa 


Intactness Index (BN) 

geographically by biomes/large 
regional areas (e.g. southern 

only 


Note: These are 

Africa), major marine 



interim control 

ecosystems (e.g., coral reefs) or 



variables until more 
appropriate ones are 
developed 

by large functional groups 


Stratospheric 

Stratospheric O 3 

<5% reduction from pre- 

Only transgressed 

ozone 

concentration, DU 

industrial level of 290 DU 

over Antarctica i 

depletion 


(5%-10%), assessed by 

Austral spring 

(R2009: same) 


latitude 

(-200 DU) 

Ocean 

Carbonate ion 

>80% of the pre-industrial 

-84% of the 

acidification 

concentration. 

aragonite saturation state of 

pre-industrial 

(R2009: 

average global 

mean surface ocean, including 

aragonite 

same) 

surface ocean 

natural diel and seasonal 

saturation state 


saturation state with 
respect to aragonite 

(^arag) 

variability (>80%- >70%) 


Biogeochemical 

P Global: P flow 

11 Tg P yr-i (11-100 Tg P yr^) 

-22 Tg P yr^ 

flows: (P and 

from freshwater 



N cycles) 

systems into the 



(R2009: 

ocean 



Biogeochemical 
flows: (interference 

P Regional: P flow 

6.2 Tg yr"^ mined and applied to 

-14 Tg P yr-i 

with P and N 

from fertilizers to 

erodible (agricultural) soils 


cycles)) 

erodible soils 

(6.2-11.2 Tg yr“^). Boundary is a 
global average but regional 
distribution is critical for 
impacts. 



N Global: Industrial 

62 Tg N yr^ (62-82 Tg N yr^). 

-150 Tg N yr-i 


and intentional 

Boundary acts as a global 



biological fixation 

‘valve’ limiting introduction of 



of N 

new reactive N to Earth System, 
but regional distribution of 
fertilizer N is critical for 
impacts. 
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Earth-system 

process 


Control 

variable(s) 


Planetary boundary Current value of 

(zone of uncertainty) control variable 


Land-system 

Global: Area of 

Global: 75% (75-54%) Values 

change 

forested land as % 

are a weighted average of the 

(R2009: 

of original forest 

three individual biome 

same) 

cover 

boundaries and their uncertainty 
zones 


Biome: Area of 

8/ome: 


forested land as % 

Tropical: 85% (85-60%) 


of potential forest 

Temperate: 50% (50-30%) 

Boreal: 85% (85-60%) 

Freshwater 

Global: Maximum 

Global: 4000 km^ yr^ 

use 

amount of 

(4000-6000 km^ yr^) 

(R2009: 

consumptive blue 


Global 

water use (km^yr“^) 


freshwater 

use) 

Basin'. Blue water 

Basin: Maximum monthly 


withdrawal as % of 

withdrawal as a percentage 


mean monthly river 

of mean monthly river flow. 


flow 

For low-flow months: 25% 
(25-55%); for intermediate- 
flow months: 30% (30-60%); 
for high-flow months: 55% 
(55-85%) 

Atmospheric 

Global: Aerosol 


aerosol 

Optical Depth 


loading 

(AOD), but much 


(R2009: 

regional variation 


same) 


Regional: AOD as 

Regional: (South Asian 


a seasonal average 

Monsoon as a case study): 


over a region. South 

anthropogenic total (absorbing 


Asian Monsoon 

and scattering) AOD over 


used as a case study 

Indian subcontinent of 0.25 
(0.25-0.50): absorbing 
(warming) AOD less than 10% 
of total AOD 

Introduction 

No control variable 

No boundary currently 

of novel entities 

currently defined 

identified, but see boundary 

(R2009: Chemical 


for stratospheric ozone for an 

pollution) 


example of a boundary 
related to a novel entity (CFOs) 


62% 


-2600 km^ yr ^ 


0.30 AOD, over 
South Asian 
region 


events in many regions of the world is increasing 
{17)', changes in atmospheric circulation patterns 
have increased drought in some regions of the 
world {17)', and the rate of combined mass loss 
from the Greenland and Antarctic ice sheets is 
increasing {36). 

Changes in biosphere integrity 

We propose a two-component approach, address- 
ing two key roles of the biosphere in the Earth 
system. The first captures the role of genetically 
unique material as the “information bank” that 
ultimately determines the potential for life to 


continue to coevolve with the abiotic component 
of the Earth system in the most resilient way 
possible. Genetic diversity provides the long-term 
capacity of the biosphere to persist under and 
adapt to abrupt and gradual abiotic change. The 
second captures the role of the biosphere in 
Earth-system fimctioning through the value, range, 
distribution, and relative abundance of the func- 
tional traits of the organisms present in an eco- 
system or biota (7). 

For the first role, the concept of phylogenetic 
species variability (PSV) (7, 33, 37) would be an 
appropriate control variable. However, because 


global data are not yet available for PSV, we re- 
tain the global extinction rate as an interim con- 
trol variable, although it is measured inaccurately 
and with a time lag. There may be a considerable 
risk in using extinction rate as a control variable, 
because phylogenetic (and functional) diversity 
may be more sensitive to human pressures than 
species-level diversity {38). In principle, the bound- 
aiy should be set at a rate of loss of PSV no greater 
than the rate of evolution of new PSV during the 
Holocene. Because that is unknown, we must fall 
back on the (imperfectly) known extinction rate 
of well-studied organisms over the past several 
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Climate change 

Biosphere 

integrity diversity Novel entities 


Stratospheric 
ozone depletion 


Atmospheric aerosol 
loading 


Nitrogen Ocean acidification 

Biogeochemical flows 


Fig. 3. The current status of the control variables for seven of the nine planetary boundaries. The 

green zone is the safe operating space (below the boundary), yellow represents the zone of uncertainty 
(increasing risk), and red is the high-risk zone. The planetary boundary itself lies at the inner heavy circle. 
The control variables have been normalized for the zone of uncertainty (between the two heavy circles): 
the center of the figure therefore does not represent values of 0 for the control variables. The control 
variable shown for climate change is atmospheric CO 2 concentration. Processes for which global-level 
boundaries cannot yet be quantified are represented by gray wedges: these are atmospheric aerosol 
loading, novel entities, and the functional role of biosphere integrity. Modified from (J). 


■ Beyond zone of uncertainty (high risk) 

□ In zone of uncertainty (increasing risk) 

■ Below boundary (safe) 

□ Boundary not yet quantified 


Functional 

diversity 


Land-system 

change 


Freshwater 



Phosphorus 


million years— about 1 per million species-years 
(3P)— and add a large uncertainty bound, raising 
the boundary to 10 per million species-years. The 
risk is that, although the Earth system can tol- 
erate a higher-than-background level of extinc- 
tions for a time, we do not know what levels of, or 
types of, biodiversity loss may possibly trigger non- 
linear or irreversible changes to the Earth system. 

The second control variable aims to capture the 
role of the biosphere in Earth-system functioning 
and measures loss of biodiversity components at 
both global and biome/large ecosystem levels. Al- 
though several variables have been developed at 
local scales for measuring functional diversity 
[e.g., {40)], finding an appropriate control varia- 
ble at regional or global levels is challenging. For 
the present, we propose an interim control var- 
iable, the Biodiversity Intactness Index (BII) {41). 
BII assesses change in population abundance as a 
result of human impacts, such as land or resource 
use, across a wide range of taxa and functional 
groups at a biome or ecosystem level using pre- 
industrial era abundance as a reference point. The 
index typically ranges from 100% (abundances 
across all functional groups at preindustrial levels) 
to lower values that reflect the extent and degree 
of human modification of populations of plants 
and animals. BII values for particular functional 
groups can go above 100% if human modifications 
to ecosystems lead to increases in the abundance 
of those species. 

Due to a lack of evidence on the relationship 
between BII and Earth-system responses, we pro- 


pose a preliminary boundary at 90% of the BII 
but with a very large uncertainty range (90 to 
30%) that reflects the large gaps in our knowl- 
edge about the BII-Earth-system functioning 
relationship {42, 43). BII has been so far applied 
to southern Africa’s terrestrial biomes only (see 
fig. S3 for an estimation of aggregated human 
pressures on the terrestrial biosphere globally), 
where the index (not yet disaggregated to func- 
tional groups) was estimated to be 84%. BII 
ranged from 69 to 91% for the seven countries 
where it has been applied {41). Observations across 
these countries suggest that decreases in BII ad- 
equately capture increasing levels of ecosystem 
degradation, defined as land uses that do not al- 
ter the land-cover type but lead to a persistent 
loss in ecosystem productivity {41). 

In addition to further work on functional mea- 
sures such as BII, in the longer term the concept 
of biome integrity— the functioning and persist- 
ence of biomes at broad scales (7)— offers a prom- 
ising approach and, with further research, could 
provide a set of operational control variables (one 
per biome) that is appropriate, robust, and scien- 
tifically based. 

Stratospheric ozone depletion 

We retain the original control variable [O3 con- 
centration in DU (Dobson units)] and boundary 
(275 DU). This boundary is only transgressed 
over Antarctica in the austral spring, when O3 
concentration drops to about 200 DU {44). How- 
ever, the minimum O3 concentration has been 


steady for about 15 years and is expected to rise 
over the coming decades as the ozone hole is 
repaired after the phasing out of ozone-depleting 
substances. This is an example in which, after a 
boundary has been transgressed regionally, hu- 
manity has taken effective action to return the 
process back to within the boundary. 

Ocean acidification 

This boundary is intimately linked with one of 
the control variables, CO2, for the climate change 
PB. The concentration of free ions in the sur- 
face ocean has increased by about 30% over the 
past 200 years due to the increase in atmospheric 
CO2 (45). This, in turn, influences carbonate chem- 
istry in surface ocean waters. Specifically, it lowers 
the saturation state of aragonite (fla^ag), a form of 
calcium carbonate formed by many marine orga- 
nisms. At flarag < aragonite will dissolve. No 
new evidence has emerged to suggest that the 
originally proposed boundary (>80% of the pre- 
industrial average annual global flarag) should 
be adjusted, although geographical heterogeneity 
in Harag IS important in monitoring the state of 
the boundary around the world’s oceans (fig. S4). 
Currently, Uarag is approximately equal to 84% of 
the preindustrial value {46). This boundary would 
not be transgressed if the climate-change bound- 
ary of 350 ppm CO2 were to be respected. 

Biogeochemicai flows 

The original boundary was formulated for phos- 
phorus (P) and nitrogen (N) only, but we now 
propose a more generic PB to encompass human 
influence on biogeochemicai flows in general. Al- 
though the carbon cycle is covered in the climate- 
change boundaiy, other elements, such as silicon 
{47, 48), are also important for Earth-system func- 
tioning. Furthermore, there is increasing evidence 
that ratios between elements in the environment 
may have impacts on biodiversity on land and in 
the sea {49-51). Thus, we may ultimately need to 
develop PBs for other elements and their ratios, 
although for now we focus on P and N only. 

A two-level approach is now proposed for the 
P component of the biogeochemicai flows bound- 
ary (see also the supplementary materials). The 
original global-level boundary, based on the pre- 
vention of a large-scale ocean anoxic event, is 
retained, with the proposed boundary set at a 
sustained flow of 11 Tg P year“^ from freshwater 
systems into the ocean. Based on the analysis of 
Carpenter and Bennett {3), we now propose an 
additional regional-level P boundary, designed 
to avert widespread eutrophication of freshwater 
systems, at a flow of 6.2 Tg P year“^ from fer- 
tilizers (mined P) to erodible soils. 

Given that the addition of P to regional 
watersheds is almost entirely from fertilizers, the 
regional-level boundary applies primarily to the 
world’s croplands. The current global rate of ap- 
plication of P in fertilizers to croplands is 14.2 Tg 
P year"^ (52, 53). Observations point toward a few 
agricultural regions of very high P application 
rates as the main contributors to the transgres- 
sion of this boundary (Fig. 2 and fig. S5A) and 
suggest that a redistribution of P from areas 
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where it is currently in excess to areas where the 
soil is naturally P-poor may simultaneously boost 
global crop production and reduce the transgres- 
sion of the regional-level P boundary {3, 52, 54). 

The N boundary has been taken from the com- 
prehensive analysis of de Vries et al. (5), which 
proposed a PB for eutrophication of aquatic eco- 
systems of 62 Tg N year“^ from industrial and 
intentional biological N fixation, using the most 
stringent water quality criterion. As for the P 
boundary, a few agricultural regions of very high 
N application rates are the main contributors to 
the transgression of this boundary (Fig. 2 and 
fig. S5B). This suggests that a redistribution of N 
could simultaneously boost global crop produc- 
tion and reduce the transgression of the regional- 
level boundary. 

Because the major anthropogenic perturba- 
tion of both the N and P cycles arises from fertil- 
izer application, we can analyze the links between 
the independently determined N and P bounda- 
ries in an integrated way based on the N:P ratio 
in the growing plant tissue of agricultural crops. 
Applying this ratio, which is on average 11.8 (55), 
to the P boundary (6.2 Tg P year“^) gives an N 
boundary of 73 Tg N year“^. Conversely, applying 
the ratio to the N boundary (62 Tg N year"^) gives 
a P boundary of 5.3 Tg P year“^. The small dif- 
ferences between the boundaries derived using 
the N:P ratio and those calculated independent- 
ly, which are likely nonsignificant differences 
given the precision of the data available for the 
calculations, show the internal consistency in 
our approach to the biogeochemical boundaries. 

More detail on the development of the P and N 
boundaries is given in (33), where we also em- 
phasize that the proposed P and N boundaries 
maybe larger for an optimal allocation of N (and 
P) over the globe. 

Land-system change 

The updated biosphere integrity boundary pro- 
vides a considerable constraint on the amount 
and pattern of land-system change in all ter- 
restrial biomes: forests, woodlands, savannas, 
grasslands, shrublands, tundra, and so on. The 
land-system change boundary is now focused 
more tightly on a specific constraint: the biogeo- 
physical processes in land systems that directly 
regulate climate— exchange of energy, water, and 
momentum between the land surface and the 
atmosphere. The control variable has been changed 
from the amount of cropland to the amount of 
forest cover remaining, as the three major forest 
biomes— tropical, temperate and boreal— play a 
stronger role in land surface-climate coupling 
than other biomes (56, 57). In particular, we fo- 
cus on those land-system changes that can in- 
fluence the climate in regions beyond the region 
where the land-system change occurred. 

Of the forest biomes, tropical forests have sub- 
stantial feedbacks to climate through changes in 
evapotranspiration when they are converted to 
nonforested systems, and changes in the distribu- 
tion of boreal forests affect the albedo of the land 
surface and hence regional energy exchange. Both 
have strong regional and global teleconnections. 


The biome-level boundary for these two types of 
forest have been set at 85% (Table 1 and the 
supplementary materials), and the boundary for 
temperate forests has been proposed at 50% of 
potential forest cover, because changes to tem- 
perate forests are estimated to have weaker in- 
fluences on the climate system at the global level 
than changes to the other two major forest 
biomes (56). These boundaries would almost 
surely be met if the proposed biosphere integ- 
rity boundary of 90% BII were respected. 

Estimates of the current status of the land- 
system change boundary are given in Figs. 2 and 
3 and fig. S6 and in (58). 

Freshwater use 

The revised freshwater use boundary has retained 
consumptive use of blue water [from rivers, lakes, 
reservoirs, and renewable groundwater stores 
(59)] as the global-level control variable and 
4000 km^/year as the value of the boundary. 
This PB may be somewhat higher or lower de- 
pending on rivers’ ecological flow requirements 
(6). Therefore, we report here a new assessment 
to complement the PB with a basin-scale bound- 
ary for the maximum rate of blue water with- 
drawal along rivers, based on the amount of water 
required in the river system to avoid regime shifts 
in the functioning of flow-dependent ecosystems. 
We base our control variable on the concept of 
environmental water flows (EWF), which defines 
the level of river flows for different hydrological 
characteristics of river basins adequate to main- 
tain a fair-to-good ecosystem state (60-62). 

The variable monthly flow (VMF) method 
(33, 63) was used to calculate the basin-scale 
boundary for water. This method takes account 
of intra-annual variability by classifying flow re- 
gimes into high-, intermediate-, and low-flow 
months and allocating EWF as a percentage of 
the mean monthly flow (MMF). Based on this 
analysis, the zones of uncertainty for the river- 
basin scale water boundary were set at 25 to 55% 
of MMF for the low-flow regime, 40 to 70% for 
the intermediate-flow regime, and 55 to 85% for 
the high-flow regime (table S2). The boundaries 
were set at the lower end of the uncertainty 
ranges that encompass average monthly EWF. 
Our new estimates of the current status of the 
water use boundary— computed based on grid 
cell-specific estimates of agricultural, industrial, 
and domestic water withdrawals— are shown in 
Figs. 2 and 3, with details in figs. S7 and S8. 

Atmospheric aerosol loading 

Aerosols have well-known, serious human health 
effects, leading to about 7.2 million deaths per 
year (64). They also affect the functioning of the 
Earth system in many ways (65) (fig. S9). Here, 
we focus on the effect of aerosols on regional 
ocean-atmosphere circulation as the rationale 
for a separate aerosols boundary. We adopt aero- 
sol optical depth (AOD) (33) as the control var- 
iable and use the south Asian monsoon as a case 
study, based on the potential of widespread aero- 
sol loading over the Indian subcontinent to switch 
the monsoon system to a drier state. 


The background AOD over south Asia is -0.15 
and can be as high as 0.4 during volcanic events 
(66). Emissions of black carbon and organic car- 
bon from cooking and heating with biofuels and 
from diesel transportation, and emission of sul- 
fates and nitrates from fossil fuel combustion, 
can increase seasonal mean AODs to as high as 
0.4 (larger during volcanic periods), leading to 
decreases of 10 to 15% of incident solar radiation 
at the surface (fig. S9). A substantial decrease in 
monsoon activity is likely around an AOD of 0.50, 
an increase of 0.35 above the background (67). 
Taking a precautionary approach toward uncer- 
tainties surrounding the position of the tipping 
point, we propose a boundary at an AOD of 0.25 
(an increase due to human activities of 0.1), with 
a zone of uncertainty of 0.25 to 0.50. The annual 
mean AOD is currently about 0.3 (66), within the 
zone of uncertainty. 

Introduction of novel entities 

We define novel entities as new substances, new 
forms of existing substances, and modified life 
forms that have the potential for unwanted geo- 
physical and/or biological effects. Anthropogenic 
introduction of novel entities to the environment 
is of concern at the global level when these en- 
tities exhibit (i) persistence, (ii) mobility across 
scales with consequent widespread distributions, 
and (iii) potential impacts on vital Earth-system 
processes or subsystems. These potentially in- 
clude chemicals and other new types of engi- 
neered materials or organisms [e.g., (68-71)'] not 
previously known to the Earth system, as well as 
naturally occurring elements (for example, heavy 
metals) mobilized by anthropogenic activities. 
The risks associated with the introduction of 
novel entities into the Earth system are exempli- 
fied by the release of CFCs (chlorofiuorocarbons), 
which are very useful synthetic chemicals that 
were thought to be harmless but had unexpected, 
dramatic impacts on the stratospheric ozone layer. 
In effect, humanity is repeatedly running such 
global-scale experiments but not yet applying the 
insights from previous experience to new appli- 
cations (72, 73). 

Today there are more than 100,000 substances 
in global commerce (74). If nanomaterials and 
plastic polymers that degrade to microplastics 
are included, the list is even longer. There is also 
a “chemical intensification” due to the rapidly 
increasing global production of chemicals, the 
expanding worldwide distribution as chemical 
products or in consumer goods, and the exten- 
sive global trade in chemical wastes (75). 

In recent years, there has been a growing de- 
bate about the global-scale effects of chemical 
pollution, leading to calls for the definition of 
criteria to identify the kinds of chemical sub- 
stances that are likely to be globally problematic 
(76, 77). Persson et al. (73) proposed that there are 
three conditions that need to be fulfilled for a 
chemical to pose a threat to the Earth system: (i) 
the chemical has an unknown disruptive effect 
on a vital Earth-system process; (ii) the disruptive 
effect is not discovered until it is a problem at the 
global scale; and (iii) the effect is not readily 
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reversible. The challenge to the research commu- 
nity is to develop the knowledge base that allows 
the screening of chemicals, before they are re- 
leased into the environment, for properties that 
may predispose them toward becoming global 
problems. 

As a first step toward meeting this challenge, 
the three conditions outlined above have been 
used as the basis for identifying scenarios of 
chemical pollution that fulfill the conditions and 
as a next step for pinpointing chemical profiles 
that fit the scenarios {28). This proposal consti- 
tutes a first attempt at adding the Earth-system 
perspective when assessing hazard and risk of 
chemicals and offers a vision for a systematic ap- 
proach to a complex management situation with 
many unknowns. 

Despite this progress in developing an Earth- 
system-oriented approach, there is not yet an 
aggregate, global-level analysis of chemical pol- 
lution on which to base a control variable or a 
boundary value. It may also serve little purpose 
to define boundary values and control varia- 
bles for a planetary boundary of this complexity. 
Nevertheless, there is a potential threat from 
novel entities to disrupt the functioning of the 
Earth-system and society needs to learn how to 
mitigate these unknown risks and manage chem- 
icals under uncertainty {28, 73). 

Some precautionary and preventive actions 
can be considered. These may include a stronger 
focus on green chemistry {78), finding synergies 
with risk-reducing interventions in other fields 
such as occupational health {79), paying more 
attention to learning from earlier mistakes {80, 
81), and investing in science to better under- 
stand and monitor vital Earth-system processes 
in order to be able to detect disruptive effects 
from novel entities as early as possible. 

Hierarchy of boundaries 

An analysis of the many interactions among the 
boundaries (table S3 and fig. SIO) suggests that 
two of them— climate change and biosphere 
integrity— are highly integrated, emergent system- 
level phenomena that are connected to all of the 
other PBs. They operate at the level of the whole 
Earth system (7) and have coevolved for nearly 
4 billion years {82). They are regulated by the 
other boundaries and, on the other hand, pro- 
vide the planetary-level overarching systems with- 
in which the other boundary processes operate. 
Furthermore, large changes in the climate or in 
biosphere integrity would likely, on their own, 
push the Earth system out of the Holocene state. 
In fact, transitions between time periods in Earth 
history have often been delineated by substantial 
shifts in climate, the biosphere, or both {82, 83). 

These observations suggest a two-level hierar- 
chy of boundaries, in which climate change and 
biosphere integrity should be recognized as core 
planetary boundaries through which the other 
boundaries operate. The crossing of one or more 
of the other boundaries may seriously affect hu- 
man well-being and may predispose the trans- 
gression of a core boundary(ies) but does not by 
itself lead to a new state of the Earth system. This 


hierarchical approach to classifying the bounda- 
ries becomes clearer by examining in more detail 
the roles of climate and biosphere integrity in the 
functioning of the Earth system. 

The climate system is a manifestation of the 
amount, distribution, and net balance of energy 
at Earth’s surface. The total amount of energy 
sets the overall conditions for life. In Earth’s cur- 
rent climate, a range of global surface temper- 
atures and atmospheric pressures allows the three 
phases of water to be present simultaneously, 
with ice and water vapor playing critical roles in 
the physical feedbacks of the climate system. The 
distribution of energy by latitude, over the land 
and sea surfaces, and within the ocean plays a 
major role in the circulation of the two great 
fluids, the ocean and the atmosphere. These sys- 
temic physical characteristics are key spatial de- 
terminants of the distribution of the biota and 
the structure and functioning of ecosystems and 
are controllers of biogeochemical flows. 

Biosphere integrity is also crucial to Earth- 
system functioning, where the biosphere is de- 
fined as the totality of all ecosystems (terrestrial, 
freshwater, and marine) on Earth and their biota 
{32). These ecosystems and biota play a critical 
role in determining the state of the Earth system, 
regulating its material and energy flows and its 
responses to abrupt and gradual change (7). Di- 
versity in the biosphere provides resilience to 
terrestrial and marine ecosystems {83, 84). The 
biosphere not only interacts with the other plan- 
etary boundaries but also increases the capacity 
of the Earth system to persist in a given state under 
changes in these other boundaries. The ultimate 
basis for the many roles that the biosphere plays 
in Earth-system dynamics is the genetic code of 
the biota, the basic information bank that de- 
fines the biosphere’s functional role and its ca- 
pacity to innovate and persist into the future. 

Planetary boundaries in a 
societal context 

A proposed approach for sustainable develop- 
ment goals (SDGs) (85) argues that the stable 
functioning of the Earth system is a prereq- 
uisite for thriving societies around the world. This 
approach implies that the PB framework, or 
something like it, will need to be implemented 
alongside the achievement of targets aimed at 
more immediate human needs, such as provi- 
sion of clean, affordable, and accessible energy 
and the adequate supply of food. World devel- 
opment within the biophysical limits of a stable 
Earth system has always been a necessity [e.g., 
{86, 87)]. However, only recently, for a number 
of reasons, has it become possible to identify, 
evaluate, and quantify risks of abrupt planetary- 
and biome-level shifts due to overshoot of key 
Earth-system parameters: (i) the emergence of 
global-change thinking and Earth-system think- 
ing (88); (ii) the rise of ‘the Planetary” as a rel- 
evant level of complex system understanding 
{89-92); and (iii) observable effects of the rapid 
increase in human pressures on the planet {16). 

The PB approach is embedded in this emerg- 
ing social context, but it does not suggest how to 


maneuver within the safe operating space in the 
quest for global sustainability. For example, the 
PB framework does not as yet account for the re- 
gional distribution of the impact or its histor- 
ical patterns. Nor does the PB framework take 
into account the deeper issues of equity and cau- 
sation. The current levels of the boundary pro- 
cesses, and the transgressions of boundaries that 
have already occurred, are unevenly caused by 
different human societies and different social 
groups. The wealth benefits that these trans- 
gressions have brought are also unevenly distrib- 
uted socially and geographically. It is easy to 
foresee that uneven distribution of causation and 
benefits will continue, and these differentials 
must surely be addressed for a Holocene-like 
Earth-system state to be successfully legitimated 
and maintained. However, the PB framework as 
currently construed provides no guidance as to 
how this may be achieved [although some po- 
tential synergies have been noted (54)], and it 
cannot readily be used to make choices between 
pathways for piecemeal maneuvering within 
the safe operating space or more radical shifts of 
global governance {93). 

The nature of the PB framework implies that 
two important cautions should be observed when 
application of the framework to policy or man- 
agement is proposed: boundary interactions and 
scale. 

Boundary interactions 

The planetary boundaries framework arises from 
the scientific evidence that Earth is a single, 
complex, integrated system— that is, the bound- 
aries operate as an interdependent set [e.g., 
(P4)] (table SI and fig. SIO). Although a system- 
atic, quantitative analysis of interactions among 
all of the processes for which boundaries are 
proposed remains beyond the scope of current 
modeling and observational capacity, the Earth 
system clearly operates in well-defined states in 
which these processes and their interactions 
can create stabilizing or destabilizing feedbacks 
{16, 90, 96). This has profound implications for 
global sustainability, because it emphasizes the 
need to address multiple interacting environ- 
mental processes simultaneously (e.g., stabilizing 
the climate system requires sustainable forest 
management and stable ocean ecosystems). 

Scaie 

The PB framework is not designed to be “down- 
scaled” or “disaggregated” to smaller levels, such 
as nations or local communities. That said, the 
PB framework recognizes the importance of 
changes at the level of subsystems in the Earth 
system (e.g., biomes or large river basins) on the 
functioning of the Earth system as a whole. Also, 
there are strong arguments for an integrated ap- 
proach coupling boundary definitions at region- 
al and global levels with development goals to 
enable the application of “PB thinking” at lev- 
els (nations, basins, and regions) where policy 
action most commonly occurs [e.g., (85, 96)]. 

This update of the PB framework is one step on 
a longer-term evolution of scientific knowledge to 
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inform and support global sustainability goals 
and pathways. This evolution is needed more 
than ever before; there are severe implementa- 
tion gaps in many global environmental policies 
relating to the PB issues, where problematic 
trends are not being halted or reversed despite 
international consensus about the urgency of the 
problems. The prospect of tighter resource con- 
straints and rising environmental hazards is also 
unavoidably turning the focus onto global social 
equity and the planetary stewardship of Earth’s 
life-support system. There is a need for a truly 
global evidence base, with much greater integra- 
tion among issues, in order to respond to these 
global challenges. New research initiatives [e.g.. 
Future Earth (www.futureearth.org)] provide evi- 
dence that science can respond to this need by 
applying Earth-system research to advance a new 
generation of integrated global analyses and to 
explore options for transformations toward sus- 
tainability. This is a clear sign that, as the risks 
of the Anthropocene to human well-being be- 
come clearer, research is maturing to a point 
where a systemic step-change is possible— and 
necessary— in exploring and defining a safe and 
just planetary operating space for the further 
development of human societies. 

Methods summary 

Our approach to building the planetary bound- 
aries framework is described above. We have 
implemented the framework through an ex- 
pert assessment and synthesis of the scientific 
knowledge of intrinsic biophysical processes that 
regulate the stability of the Earth system. Our 
precautionary approach is based on the main- 
tenance of a Holocene-like state of the Earth 
system and on an assessment of the level of 
human-driven change that would risk destabi- 
lizing this state. For the climate change PB, there 
is already much literature on which to base 
such an assessment. For others, such as strato- 
spheric ozone, ocean acidification, extinction 
rates, and P and N cycles, we have used estimates 
of preindustrial values of the control variable 
as a Holocene baseline. Where large, undesira- 
ble thresholds exist and have been studied (e.g., 
polar ice sheets, Amazon rainforest, aragonite 
dissolution, atmospheric aerosols, and the south 
Asian monsoon), quantitative boundaries can be 
readily proposed. For others, where the focus is 
on erosion of Earth-system resilience, the bound- 
aries are more difficult (but not impossible) to 
quantify, as reflected in larger uncertainty zones. 

We used large-scale assessments of the impacts 
of human activities on Earth-system functioning 
[e.g.. Intergovernmental Panel on Climate Change 
{17, 18), the International Geosphere-Biosphere 
Programme synthesis {16), and chemicals (75, 80)'] 
as sources of community-level understanding 
on which to propose PBs. Our update has also 
relied on post-2009 assessments of individual 
boundaries by the relevant expert research com- 
munities; examples include phosphorus {3), ni- 
trogen (5), biosphere integrity (7), freshwater use 
(5, 63), and novel entities [with a focus on chem- 
icals {28, 73)]. Finally, some new analyses have 


been undertaken specifically for this paper: (i) a 
freshwater-use PB based on the EWE approach 
{33, 63)', (ii) the linkage of the phosphorus and 
nitrogen boundaries via the N:P ratio in grow- 
ing crop tissue {33)', and (hi) the use of major 
forest biomes as the basis for the land-system 
change PB {33). 
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A data-intensive approach to 
mechanistic elucidation applied 
to chiral anion catalysis 

Anat Milo/* Andrew J. Neel,^* F. Dean Toste,^t Matthew S. Sigman^f 

Knowledge of chemical reaction mechanisms can facilitate catalyst optimization, but 
extracting that knowledge from a complex system is often challenging. Here, we present 
a data-intensive method for deriving and then predictively applying a mechanistic model 
of an enantioselective organic reaction. As a validating case study, we selected an 
intramolecular dehydrogenative C-N coupling reaction, catalyzed by chiral phosphoric 
acid derivatives, in which catalyst-substrate association involves weak, noncovalent 
interactions. Little was previously understood regarding the structural origin of 
enantioselectivity in this system. Catalyst and substrate substituent effects were probed 
by means of systematic physical organic trend analysis. Plausible interactions between 
the substrate and catalyst that govern enantioselectivity were identified and supported 
experimentally, indicating that such an approach can afford an efficient means of 
leveraging mechanistic insight so as to optimize catalyst design. 


C atalyst discovery and development often 
rely on empirical observations gained 
through the laborious evaluation of mul- 
tiple potential reaction variables (i). Although 
high-throughput methods can streamline 
this process (1-9), the ability to rationally design 
catalysts that affect chemical reactions in a pre- 
dictable manner would be a transformative step 
forward. In catalysis, the principal challenge lies 
in inferring how catalyst structural features affect 
the mechanistic aspects of a given chemical re- 
action, including those that govern selectivity 
when multiple products are possible (10-12). Al- 
though mechanistic studies are able to guide the 
rational design of catalytic systems, traditional 
approaches are not often suited to address the 
complexity of modem catalytic transformations. 
This limitation is especially apparent in cases in 
which selectivity is affected by subtle catalyst and 
substrate structural features (13), and/or the 
product-determining step of the reaction occurs 
after the rate-determining step. In order to address 
such systems, we envisioned a strategy for mecha- 
nistic study involving the application of modem 
data analysis techniques. This approach relies on 
the generation of mathematical correlations bet- 
ween quantifiable properties describing the inter- 
acting reaction partners’ molecular structures 
(molecular descriptors) and a measurable outcome 
of the reaction {for example, enantioselectivity. 
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which is represented as the energy difference 
between transition states leading to either enan- 
tiomer AAG* * = -RTlnl(S)/(R)^, kilocalories/mole} 
(14). Combining appropriate experimental design, 
data organization, and trend analysis techniques 
provides the basis to distinguish causal relations, 
producing testable hypotheses regarding the stmc- 
tural origin of the reaction outcome. New catalysts 
can be designed, and the ability of the models to 
predict new experimental outcomes can be used 
as validation of the mechanistic hypotheses. Here, 
we demonstrate that this approach enables in- 
depth mechanistic analysis of interactions that 
govern enantioselectivity, affording nonintuitive 
insight into the origin of asymmetric induction 
and guiding rational catalyst design. 

Choice of case study 

To assess the applicability of this data-driven 
approach toward mechanistic analysis, a system 
was sought with a poorly understood mecha- 
nism that would be difficult to probe by using 
standard techniques. Accordingly, the field of 
chiral anion catalysis was appealing because 
diverse catalyst-substrate interactions contrib- 
ute to enantioselectivity, but their distinctive 
effects are difficult to deconvolute. Particularly, 
the oxoammonium salt (3)-mediated enantio- 
selective cross dehydrogenative coupling (CDC) cat- 
alyzed by chiral l,l-Bi-2-naphthol (BINOL)-based 
phosphoric acids (PAs) (Fig. 1, 1) bearing triazoles 
at the 3 and 3' positions reported in 2013 by Toste 
and coworkers (75) was identified as a prototyp- 
ical example. This type of reaction could benefit 
from such an analysis because of the following 
challenges. First, high levels of enantioselectivity 
were necessarily predicated on the oxidant’s 
(16-19) insolubility under the reaction condi- 


tions, precluding rigorous kinetic analyses. Sec- 
ond, the enantioselectivity trends with respect 
to both catalyst and substrate were not obvious, 
with even modest structural modifications result- 
ing in substantial differences (Fig. 1C). Last, we 
hypothesized that enantioselectivity was governed 
by attractive noncovalent interactions (75). These 
subtle interactions are ubiquitous in biological 
and catalytic systems (73) but are difficult to 
study or apply toward rational catalyst design, 
especially if several such interactions are involved 
in determining reaction outcomes. 

The distinct mechanistic features of triazole- 
PA catalysts are highlighted by the observation 
that they result in opposite and enhanced en- 
antioselectivities (Fig. lA) relative to more con- 
ventional PAs such as Cg-TRIP ( 5 ) and TCYP ( 6 ), 
which are representative of BINOL-based PAs 
that have seen the most extensive use (20-22). 
Additionally, electronically distinct pyrazolyl 
(pyr-le) and imidazolyl (imid-le) PAs afford 
products with significantly reduced enantio- 
selectivities relative to the parent triazolyl (Fig. lA, 
le), despite having nearly identical steric pro- 
files. Although these data allude to selectivity 
determination via attractive, noncovalent inter- 
actions between the catalyst and substrate, such 
interactions are difficult to further characterize. 
This limitation is not uncommon in enantio- 
selective catalysis. Thus, our goal was to develop 
a general, data-driven technique for the evalu- 
ation of how subtle structural features affect se- 
lectivity, using this reaction as a challenging case 
study. 

Kinetic isotope effects 

Before any mechanistic study focused on the 
origins of selectivity, we sought to establish the 
enantioselectivity-imparting step (or steps) in 
the catalytic cycle. Bearing in mind the aim of 
mathematically relating structural features of 
the reacting components to enantioselectivity, 
this knowledge would reveal the elementary step 
that is being represented by the catalyst and 
substrate molecular descriptors (vide infra). With 
respect to the general mechanism in Fig. IB, we 
sought to distinguish two scenarios: (i) Enan- 
tioselectivity is determined during oxidation of 
substrate 2 , or (ii) enantioselectivity is determined 
during the cyclization of an oxidized intermediate 
(Fig. 2A, A). With respect to the former scenario, 
it was conceivable that although the stereogenic 
center is formally set in the cyclization from the 
oxidized intermediate (A), the interactions bet- 
ween the substrate and catalyst during the oxi- 
dation event may preorganize the system for 
effective enantioselection. 

To distinguish between these possibilities, a 
set of kinetic isotope (KIE) experiments was per- 
formed by using [90% D incorporation, 

74:26 enantiomeric ratio (er)]. We expected that 
if the chiral phosphate were involved in substrate 
oxidation, different KIEs would be observed for 
the formation of Askrdi when using enantiomeric 
catalysts. Indeed, (R)- and (S)-le promoted the 
reaction with KIEs of 3.42 and 1.08 respectively, 
suggesting the involvement of the catalyst in the 
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Fig. 1. System understudy: asymmetric C-N dehydrogenative coupling. (A) BINOL-based phosphoric acid scaffold, enantioselective cross dehydrogenative 
coupling reaction scheme, and nitrogen-deletion experiment. (B) Proposed mechanism involving a chiral phosphate-substrate ion pair. (C) Enantiomeric excess 
(ee) values obtained by using various substrate/catalyst combinations. (D) Library design and parameter identification strategy. 
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Fig. 2. Kinetic isotope effect studies and mechanistic implications. (A) Considerations regarding the origin of enantioselectivity. (B) KIEs of deuterated 
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Fig. 3. Impact of heterocyclic catalyst substituent on enantioselectivity. (A) Predicted and measured 
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Predicted versus measured AAG^ plot. (D) Illustration of the structural features implicated by the 
identified parameters. 


oxidation (Fig. 2B). If enantioselectivity were also 
established in this step, different enantiomeric 
excess (ee) values would be expected when using 
the enantiomeric catalysts. The observation that 
the final products exhibited equal but opposite 
levels of enantioselectivity is consistent with 
enantioselection occurring during the bond- 
forming event from an oxidized intermediate 
such as A. This result indicated that the two key 
steps of this reaction (oxidation and cyclization) 
proceed under independent Curtin-Hammett 
control (10, 11), with similar interactions presum- 
ably governing selectivity in both cases (Fig. 2C). 
The KIE values resulting from the less-selective 
catalysts pyr-le and 6 (Figs. lA and 2B) do not 
display the same enantiomer-dependency as those 
from le. This effect is consistent with the assertion 
that similar catalyst-substrate interactions are in- 
volved throughout the mechanism, as well as the 
triazole substituent’s superior ability to interact 
with substrate 2. However, the specific nature 
of this interaction remained undefined. To this 
end, a thorough analysis of an extensive data set 
containing structural perturbations to the cata- 
lyst and substrate could serve to illuminate these 
enantioselectivity-directing interactions. 

Experimental design and analysis 

The collection and organization of diverse data 
sets is at the foundation of data-driven analysis 
strategies (23). Therefore, an effective experimen- 
tal library should include rational changes to 
various structural features that affect the reac- 
tion outcome of interest. To this end, substrates 
( 2 ) were modified at positions hypothesized to 
influence enantioselectivity (at the 2-, 4-, and 6- 
positions of the benzyl ring and the distal aryl 
ring) (Fig. ID), using substituents with varied 
electronic and steric properties (according to 
their Hammett Cpara and Sterimol Bj values, re- 
spectively; additional details are provided in 
the supplementary materials, p4-8). Similarly, 
catalysts (1) were modified at the 2-, 4-, and 6- 
positions of the aryl ring attached to the triazole. 
Adamantyl-substituted catalysts le, pyr-le, and 
imid-le were also included to explore the effect 
of changes to the heterocyclic ring. In total, 12 
substrates and 11 triazolyl catalysts were selected 
(Fig. ID). These libraries were then synthesized, 
and the enantioselectivity of each catalyst-substrate 
combination was obtained. Simultaneously, a 
diverse array of molecular descriptor values 
was collected to describe the structural features 
of each catalyst and substrate, including Sterimol 
parameters (24), length measurements from 
geometry optimized structures, and computa- 
tionally derived vibrational frequencies and 
intensities (details are provided in the supple- 
mentary materials, p4-8) (Fig. ID) (25). Linear 
regression algorithms were then applied to var- 
ious subsets of the data to identify correlations 
between molecular structure and the experi- 
mentally determined enantioselectivity. Subse- 
quently, analysis and refinement of the resulting 
models were used to produce explicit mechanis- 
tic hypotheses that were then tested and val- 
idated experimentally. 


Modeling catalyst heterocyclic rings 

Given the clear importance of the catalyst het- 
erocyclic ring in enantioselectivity determina- 
tion (vide supra), we initially sought to understand 
the subset of results obtained by using catalysts 
le, pyr-le, and imid-le. Accordingly, by using 
linear regression techniques the correlation de- 
picted in Fig. 3B was identified from a training 
set of 10 different substrate-catalyst combina- 
tions (Fig. 3, A and C, black squares). Of the 
large number of steric (26) and vibrational (25) 
terms initially investigated as molecular descrip- 
tors, four discrete vibrational parameters were 
sufficient to produce a correlation with enantio- 
selectivity: one catalyst descriptor (vY-N, a stretch- 
ing frequency on the heterocyclic ring), and two 
substrate descriptors (the stretching frequency 
of the amide C=0 bond, vC=0, and stretching 
frequency/intensity of the C-H bond undergoing 
oxidation, v/iC-H) (Fig. 3, B and D). A cross- 
term between the catalyst and substrate descrip- 
tors improves the overall quality of the model 
(vY-N X iC-H), suggesting a synergistic struc- 
tural effect. The model was evaluated by plott- 
ing measured versus predicted AAG* values 
(Fig. 3C), and as a validation of its robustness, 
the enantioselectivities of 10 catalyst-substrate 
combinations not included in the training set 
were well-predicted (Fig. 3, A and C, red crosses). 
A slope approaching unity and intercept approach- 
ing zero over the training and validation sets 
indicate an accurate and predictive model, and 
the R^ value of 0.90 demonstrates a high degree 
of precision. The largest coefficient in this nor- 
malized model belongs to the heterocyclic ring 
vibrational frequency, signifying its substantial 
role in the quantification of enantioselectivity. 


This model is capable of predicting results 
whose origins are not obvious upon inspection. 
For example, comparison of the reaction out- 
comes using le and pyr-le with substrate 2a 
(Fig. 3, entries 1 versus 13) may lead to the con- 
clusion thatp^r-le generally affords inferior se- 
lectivity. Indeed, experimental results for several 
additional substrates support this notion and are 
accurately predicted by the model (for example, 
2-OMe benzyl substrate 2e, entries 5 versus 16). 
However, with substrate 2i (R^ = 2-OMe, R 2 = Ph, 
entries 9 versus 17), the triazolyl and pyrazolyl 
PAs afford the product with similar levels of 
enantioselectivity. This counterintuitive result is 
precisely predicted, indicating that the divergent 
enantioselectivity displayed by le, as compared 
withjjgfr-le and imid-le, is adequately addressed 
by the model. 

Trend analysis 

Although the model in Fig. 3B establishes the 
capacity of the chosen parameters to describe 
subtle aspects of this system, the ultimate goal 
of this approach was to discern underlying mech- 
anistic phenomena. This objective could not be 
achieved by using merely the above correlation 
because it was produced by using a small sub- 
set of data, in which the catalyst heterocyclic 
rings bore the same substituent (adamantyl). 
We hypothesized that the complete data set con- 
tained invaluable information to this end be- 
cause it was produced by using strategically 
modified catalysts and substrates with substit- 
uents intentionally introduced to probe subtle 
effects, resulting in 132 enantioselectivities bet- 
ween -54 and 94% ee, which corresponds to a 
AAG"^ range of 2.8 kcal/mol. In accordance with 
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a data-intensive strategy, none of these mea- 
sured values was discarded because even low 
enantioselectivity carries information regarding 
the catalyst-substrate interactions at the origin of 
asymmetric induction. 

Yet before producing a global, predictive mod- 
el, we considered that a series of focused corre- 
lations, coupled with an evaluation of overall 
trends, might serve to reveal fundamental fea- 
tures of the system. To this end, for each individ- 
ual substrate a correlation was produced relating 
its observed enantioselectivity values for the en- 
tire set of catalysts, with parameters describing 
the catalyst structure (2a-21, 12 models in total) 
(fig. S3). The same strategy was applied to all 
aryl-substituted triazole catalysts by using param- 
eters describing the substrate structure (Ib-ld, 
If-Ik, 9 models total) (table S7). This organiza- 
tional scheme was viewed as a means to facilitate 
the identification of catalyst features that affect 
particular substrate types (and vice versa). Sub- 
strates or catalysts with similar structures behave 
analogously not only in a qualitative manner, but 
also in terms of the molecular descriptors that 
effectively serve to predict their enantioselectiv- 
ities (individual substrate and catalyst measured 
versus predicted AAG* plots and equations are 
available in figs. S2, S3, and S4 and table S7). 
These quantitative correlations, together with sys- 
tematically organized trends of experimental 
outcomes, can guide the development of testable 
mechanistic hypotheses. 

To simultaneously inspect multiple aspects of 
large and intricate data sets, a communicative 


visualization of data is crucial {27). Thus, we 
elected to present information gained from these 
focused mathematical models, alongside multiple 
observed enantioselectivity results, organized ac- 
cording to trends in catalyst or substrate struc- 
tural features. Demonstrating this visualization 
technique, the enantioselectivity trend for each 
catalyst (in Fig. 4, each line represents a catalyst) 
was plotted as a function of the substrates (in 
Fig. 4, each x axis tick-line represents a substrate), 
and vice versa (Fig. 5). To afford a quantitative 
trend analysis, the plots were arranged according 
to which positions were modified on the catalyst 
or substrate structures, and the corresponding 
or R 2 substituent’s Sterimol value (additional 
visualizations are provided in figs. S5 and S6) 
{24). For example, in the purple frame in Fig. 4, 2- 
substituted benzyl substrates are displayed from 
the largest to smallest substituent. The catalyst 
molecular descriptors required for correlating 
each subset of substrates are also depicted (Fig. 4), 
along with the substrate descriptors for each sub- 
set of catalysts (Fig. 5). For example, the catalyst 
molecular descriptors used as parameters for the 
correlation of enantioselectivity obtained for 2- 
substituted benzyl substrates are presented below 
the blue frame in Fig. 4 (vN-N, ^tor)- 
Analysis of this systematic data arrangement 
reveals that in general, catalyst performance cor- 
relates with the aryl substitution pattern in the 
order 2,4,6 > 2,6 > 4 (Fig. 5, gray, orange, and 
blue frames, respectively). Additionally, by juxta- 
posing Figs. 4 and 5, it appears that the reaction 
is mainly under catalyst control because catalyst 


features affect enantioselectivity in a more con- 
siderable and systematic manner (Fig. 5), whereas 
for each substrate, the spread of observed enan- 
tioselectivity is broader (Fig. 4). All substrates 
bearing a 2-substituted benzyl group— even those 
with substitution at R 2 (Fig. 4, purple and blue 
frames, respectively)— can be modeled by using 
the torsion angle between the triazole and its 
substituent, and a triazole vibration frequency 
(^torj vN-N) (individual models are provided in 
table S7). The torsion angle represents a steric 
effect yet also contains information concerning 
the conjugation between the triazole ring and 
its substituent. The vibration frequency can 
serve as a correction to both of these effects 
because it takes into account nonadditive fea- 
tures of the substituents’ charge and mass dis- 
tribution (25). 

The models for substrates with 4-benzyl sub- 
stitution (Fig. 4, green frame) contain the same 
two terms (^tor? vN-N) and an additional steric 
descriptor (the catalyst aryl ring minimal width, 
Bi). Similar interactions with the triazole ring 
should still be present for these 4-benzyl sub- 
strates, but the presence of a Bi term suggests an 
additional steric interaction between the substrate 
and catalyst substituents, which is avoided in the 
case of hydrogen at the 4-benzyl position. This 
claim is supported by the lower enantioselectiv- 
ities observed for substrates with larger 4-benzyl 
substituents, especially when using catalysts with 
larger 2,6-groups. Thus, the lack of the catalyst 
Bi term in the models describing 2- relative to 
4-benzyl substrates, and their overall higher 
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Fig. 4. Graphical representation of catalyst structure-selectivity trends as a function of substrate. Enantioselectivity trends for every catalyst against all 
substrates. Each trend line represents a catalyst, and each x axis tick-line represents a substrate. 


740 13 FEBRUARY 2015 • VOL 347 ISSUE 6223 


sciencemag.org SCIENCE 


RESEARCH | RESEARCH ARTICLE 


enantioselectivities, are thought to arise from a 
better accommodation of the former substrates’ 
shapes in the catalyst active site. 

The individual catalyst trends and models car- 
ry complementary information, revealing the sub- 
strate descriptors relevant to each catalyst subset 
(Fig. 5). The parameters that correlate with 4- 
substituted aryl catalysts’ enantioselectivity (Fig. 5, 
blue frame) include the substrate carbonyl-stretching 
vibrational frequency and intensity (v/iC=0), the 
amide N-H vibrational frequency (vN-H), and a 
cross-term between the two (vC=0 x vN-H). 
These values vary in response to substitution on 
the substrate benzyl ring or the distal aryl ring, 
with the former having a greater effect (table S2). 
These same parameters effectively correlate with 
the enantioselectivity observed for 2,6-disubstituted 
aryl catalysts (Fig. 5, orange frame), along with 
an additional vibrational term (vBn) describing a 
benzyl ring stretch. The enantioselectivity spread 
of catalysts with larger substituents at the 2,4,6- 
position [for example, 38 to 93% ee for 2,4,6- 
(Cy) 3 -Ph catalyst Id (Fig. 5, gray frame)] can be 
described by using two terms associated with the 
benzyl ring (vBn, iN-H), stressing this ring’s role 
in determining enantioselectivity. 

Trend interpretation 

Collectively, these results suggest that a tt in- 
teraction is established between the triazole ring 
and substrate during the enantioselectivity deter- 
mining step, the strength of which is modulated 
by local steric and electronic structural features 
of both interacting partners (13, 28-34). Fur- 
thermore, n interactions are often strengthened 


by heteroatoms (35-37), which could explain the 
reduced enantioselectivity values obtained by 
using imid-le, pyr-le, and TCYP catalysts com- 
pared with le (Fig. lA), as well as the similar 
KIE values obtained when using both the (R)- 
and (S')-enantiomers of pyr-le and TCYP as 
catalysts, compared with the divergent ones dis- 
played by (R)- and (S)-le (Fig. 2B). In relation to 
the substrate, participation of both the benzyl 
group and the distal aryl group in putative n 
interactions are supported by the presence of 
molecular descriptors that are sensitive to sub- 
stitution on these rings in every catalyst mod- 
el (vC=0, iC=0, vN-H, and vBn) (Fig. 5 and 
table S2). 

The energy stabilization gained from n inter- 
actions is affected by the distance and geometry 
of the rings involved (13, 30-37). If a tt interac- 
tion between the substrate and triazole is at the 
origin of enantioselectivity determination, the 
directionality of the triazole— represented by 
the torsion angle between the triazole and its 
substituent— is expected to directly affect enan- 
tioselectivity. In agreement with this hypothe- 
sis, catalysts with more pronounced torsional 
effects lead to higher enantioselectivity values 
for substrates with relatively small substituents 
at the benzyl 4-position. The torsion angle ap- 
proaches perpendicularity (90°) owing to larger 
2,6-substituents on the catalyst aryl ring con- 
nected to the triazole (Fig. 4, purple and blue 
frames, blue lines). Moreover, large catalyst 2,6- 
aryl substituents are presumed to serve as a 
steric barrier, docking the substrate in place for 
an improved overlap with the catalyst triazole 


ring. Correspondingly, substrates with elongated 
4-substituents (R = Me, OMe) lead to lower en- 
antioselectivities by using catalysts with large 
substituents at the 2,6-position (Fig. 4, green 
frame, blue lines). For these substrates, the steric 
repulsion exerted by the large 2,6-substituents 
affords a weaker or less directing n interaction 
and, subsequently, lower enantioselectivity. Thus, 
the importance of the torsion angle and vibration 
parameters for correlating enantioselectivity in 
the individual models and overall trends (Fig. 4, 
fig. S2, and table S7) is proposed to reflect the 
angle at which the triazole engages the substrate 
and the steric role of the catalyst aryl group. 
Lending further credence to this proposal, cat- 
alysts with reduced torsion angles (such as cat- 
alysts with triazole R substituents: Ph, 4-N02Ph, 
4-OMePh, or 4-S02MePh) that do not introduce 
the proposed directional and steric effects lead to 
diminished enantioselectivities overall (Fig. 5). 

Comprehensive model and probes 
of mechanistic hypotheses 

On the basis of these hypotheses, we set out to 
design a series of new catalysts to specifically 
probe putative interactions. To facilitate catalyst 
development, a predictive model (Fig. 6) was 
generated for the entire substrate set with the 
aryl-substituted catalysts (Ib-ld and If-Ik). This 
model contains 108 combinations (9 catalysts 
times 12 substrates) from the initial library of 
experiments, where half were used as a training 
set (Fig. 6B, black squares) and the other half as 
external validations (Fig. 6B, red crosses). New 
catalysts were proposed to address hypotheses 



Fig. 5. Graphical representation of substrate structure-selectivity trends as a function of catalyst. Enantioselectivity trends for every substrate against 
all catalysts. Each trend line represents a substrate, and each x axis tick-line represents a catalyst. 
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raised by the focused models and trend analysis, 
and their enantioselectivity was predicted before 
synthesis by the comprehensive model (Fig. 6). 

First, to probe whether the aryl substituent on 
the triazole ring plays a primarily steric role, 
rather than directly engaging the substrate in a tt 
interaction, perfluorophenyl catalyst 11 was de- 
signed and evaluated. Substituent local sterics and 
charge distribution have been shown to strongly 
affect noncovalent n interactions (28-30, 32-39). 
Therefore, we expected that if enantioselectivity 
were predominantly dependent on the aryl sub- 
stituent directly engaging as a partner in a tt in- 
teraction (as opposed to taking an ancillary role in 
n interactions involving the triazole), perfluoro- 
phenyl catalyst 11 would deviate significantly from 
its Ad (le), 2,6-(F)2-Ph (Ik) or 2,6-(MeO)2-Ph (Ij) 
counterparts. However, all four catalysts behave 
similarly with respect to the magnitude and sign 
of enantioselectivity (Fig. 6C, entries 1 to 9 and 24 
to 26). This result is well predicted by the model 
(Fig. 6B) and is consistent with the hypothesis 
that the main function of the aryl substituent is 
steric. 

Next, catalyst Im, bearing a single isopropyl 
group at the 2-position of the triazole aryl sub- 
stituent, was prepared to probe the hypothesis 
that steric repulsion exists between larger cata- 
lyst 2,6-substituents and elongated substrate 4- 
substituents. We anticipated that an isopropyl 
group would induce the torsion necessary to 
enforce the proposed benzyl-triazole n interac- 


tion, while avoiding a direct steric interaction 
between the substrate benzyl 4-position and the 
catalyst aryl 2,6-substituents. Indeed, for all 4- 
substituted substrates tested (Fig. 6C, entries 10 
to 12), Im provided the corresponding product in 
higher enantioselectivity than that of Ic, which 
bears isopropyl groups at both ortho positions of 
the triazole aiyl substituent (Fig. 6C, entries 13 to 
15). For 4-N02-Bn substrate 2d, the 2-iPr catalyst 
Im resulted in the highest enantioselectivity ob- 
served to date (Fig. 6, B and C, entry 12), as 
predicted by the model. 

Last, in order to evaluate the capacity to obtain 
improved enantioselectivity as a result of a data- 
intensive approach, and the hypothesis that tor- 
sion leads to enhanced enantioselectivity for the 
2-substituted substrates, several proposed cata- 
lysts were evaluated by using the model in Fig. 6. 
Catalyst In was selected because it is syntheti- 
cally feasible, accommodates a torsion angle close 
to 90° and was predicted to give improved enan- 
tioselectivity for all substrates bearing hydrogen 
at the 4-benzyl position. This prediction was ver- 
ified in practice for all eight substrates evaluated, 
affording the highest enantioselectivities observed 
to date (Fig. 6C, entries 16 to 23). These results 
confirm that a perpendicular geometry of the 
triazole and the aryl ring can indeed lead to 
higher enantioselectivities, supporting the prem- 
ise that the orientation of the triazole ring cou- 
pled with its R group’s steric constraints control 
triazole k interactions. 


The overall analysis of the triazole-PA case 
study demonstrates the complementary manner 
in which classical physical organic techniques 
and modem data analysis strategies can be merged 
toward a more complete mechanistic assessment 
(40). This approach is based on the use of em- 
pirical data, which is often a prerequisite for a 
rational reaction optimization process, to con- 
comitantly conduct a mechanistic investigation. 
Information of this sort that could be used for an 
in-depth analysis is often omitted from reports in 
the field of catalysis because only results leading 
to the desired outcomes are generally presented 
and pursued. Yet because high-throughput (2, 4, 8), 
automated (7) methods for reaction development 
are now common, data analysis strategies could 
be applied in parallel to optimization procedures, 
allowing for simultaneous mechanistic and stmc- 
tural analysis. 

Creatively collecting and organizing data to 
examine proposed hypotheses affords improved 
generalizations, particularly as data sets become 
larger and more complex (23). This idea holds 
true for the analysis of reaction trends by param- 
eters that reflect structural modification. Indeed, 
the focused catalyst and substrate models— and 
their organization according to fundamental, 
quantitative, physical-organic trends— provided 
nonintuitive insights regarding interactions in- 
volved in enantioselectivity determination. Although 
this approach is general for the prediction and 
study of chemical reaction outcomes, this case 
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Fig. 6. Validation of mechanistic hypotheses through directed catalyst design. (A) Normalized equation for the prediction of enantioselectivity by using 
parameters that describe catalyst and substrate structural features. (B) Predicted versus measured AAG^ plot. (C) Hypothesis-driven external predictions and 
their comparison with other relevant catalysts. 
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Study was chosen as a stringent benchmark be- 
cause it uses weak, noncovalent interactions for 
asymmetric induction. These interactions are typ- 
ically in the energy range required to distinguish a 
highly enantioselective reaction from its racemate 
forming counterpart (2 to 3 kcal/mol) {12, 13, 32, 37), 
providing seemingly endless approaches to rational 
catalyst design. However, controlling noncova- 
lent interactions represents a notable challenge 
in the design of catalytic systems because of mul- 
tiple energetically accessible orientations {13). Com- 
plemented with rigorous experimental analysis, 
the disclosed data-intensive approach is suited to 
addressing such intricacies and holds potential 
for the analysis of increasingly complicated cat- 
alytic systems streamlining both reaction and 
catalyst development. 
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SUPERCONDUCTIVITY 

Light-induced superconductivity 
using a photoactive electric 
double layer 

Masayuki Suda/’^* Reizo Kato,^ Hiroshi M. Yamamoto^’^* 

Electric double layers (EDLs) of ionic liquids have been used in superconducting 
field-effect transistors as nanogap capacitors. Because of the freezing of the ionic 
motion below -200 kelvin, modulations of the carrier density have been limited to the 
high-temperature regime. Here we observe carrier-doping-induced superconductivity in an 
organic Mott insulator with a photoinduced EDL based on a photochromic spiropyran 
monolayer. Because the spiropyran can isomerize reversibly between nonionic and 
zwitterionic isomers through photochemical processes, two distinct built-in electric 
fields can modulate the carrier density even at cryogenic conditions. 


T he electric potential difference that usually 
exists across a phase boundary creates two 
layers of space charges of different signs, 
the so-called electric double layer (EDL) {1, 2). 
In recent years, EDL transistors with ionic 
liquids or electrolytes as gate dielectrics have 
been used increasingly {3, 4). Under an applied 
gate voltage, ions are driven to the surface of the 
channel materials, forming an EDL that acts as a 
nanogap capacitor. Using this method to accu- 
mulate a large numbers of carriers (-10^^ cm"^) 
{5-9), field-induced superconductivity has been 
realized in various materials. However, modu- 
lations of the carrier density in EDL transistors 
have been limited to the high-temperature regime 
because the ionic motions of ionic liquids or elec- 
trolytes are frozen below -200 K 

Here we observe field-induced superconductivity 
even at low temperatures by using a photoactive 
EDL system, a spiropyran monolayer. Spiropyran 
is a photochromic molecule that can photoisomerize 


^Research Center of Integrative Molecular Systems (CIMoS), 
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Japan. ^RIKEN, Wako, Saitama 351-0198 Japan. 

^Corresponding author. E-mail: msuda@ims.ac.jp (M.S.): 
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reversibly between the nonionic spiropyran (SP) 
form and the zvyitterionic merocyanine (MC) 
form upon ultraviolet (UV) (forward reaction) or 
visible light irradiation (reverse reaction) (fig. SI) 
{10, 11). Therefore, an EDL-like large electric field 
across the film can be induced or be eliminated 
by UV light or visible light irradiation, respec- 
tively. The advantage of the photochromic EDL 
system over the conventional one is that it retains 
the modulation capability even at low temper- 
atures, because these photochromic reactions 
proceed with photon-energy dissipation under 
nonequilibrium conditions. Hence, a highly aligned 
SP monolayer is a promising candidate for a 
photoinduced EDL system that can directly svyitch 
superconductivity only by photoirradiation. Al- 
though there have been attempts to tune the 
conductivity and even superconductivity using 
photochromic layers {12-17), photoinduced phase 
transitions (including superconducting transi- 
tions) have not been realized yet. 

We fabricated a photoactive superconducting 
device by laminating a thin single crystal of 
K-(BEDT-TTF) 2 Cu[N(CN) 2 ]Br (K-Br) [BEDT-TTF: 
bis(ethylenedithio)tetrathiafulvalene] on top of 
an Al 203 (or Hf 02 )/Nb:SrTi 03 substrate covered 
with a self-assembled monolayer of spyropiran 
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derivatives (SP-SAM) (Fig. 1). The device also 
has a bottom-gate structure for electrostatic car- 
rier doping to finely modulate the carrier density 
accumulated as a result of photoirradiation. We 
chose K-Br as a target material because K-type 
BEDT-TTF is a strongly correlated molecular con- 
ductor {18) that has a relatively low half-filled 
carrier density (~2 x lO^Vcm^) {19). We recently 
demonstrated a field-induced superconductivity 
in a thin single crystal of K-Br {20). 

The surface density of SP-SAM estimated by 
the comparison of absorption spectra between a 
monolayer film and a solution (fig. S2) was -1.8 x 
10^^ molecules/cm^. The surface density for the 
close-packed model (2.0 x molecules/cm^) 
was already reported {13), indicating that we 
have obtained a high-coverage SP-SAM. The 
photoisomerization of SP-SAM at room temper- 
ature was monitored by UV-visible absorption 
spectroscopy (fig. S3). The spectrum of the ini- 
tial state has a weak absorption peak at 580 nm 
that is ascribed to the n-n* transition in the MC 
form. This means that the initial state of SP-SAM 
is in a SP state-dominant mixed state. After UV 
irradiation for 5 min, the absorption peak was en- 
hanced, indicating SP-to-MC photoisomerization. 

After a subsequent irradiation with visible light, 
the peak almost vanished, indicating MC-to-SP 
back photoisomerization. SP-to-MC-to-SP photo- 
isomerization was then repeated without any at- 
tenuation of the area between the absorption 
curves. These reversible spectral changes clearly 
demonstrated reversible photoisomerization of 
spiropyrans, even after the SAM formation. 

Subsequently, we investigated the effects of 
photoirradiation of UV and visible light on the 
four-probe resistance of K-Br devices. Figure 2A 
shows a time trace of the resistance for the device 
at 5 K After brief resistance measurements in the 
dark, the device was irradiated with UV light. Dur- 
ing the UV irradiation, the resistance value drasti- 
cally decreased and the final state persisted even 
after the irradiation was stopped. Furthermore, 
this reduced resistance value recovered to nearly _ 
the initial value by visible light irradiation, which I 

also lasted even after the irradiation was stopped ~ 

(Fig. 2B), showing reversible switching capabil- 
ity. The resistance switching was repeated several 
times, with negligible attenuation (Fig. 2C). To 
rule out that the response was caused by photo- 
thermal effects, we also conducted control experi- 
ments on the K-Br device without the photochromic 
layer. We found almost no resistance change 
during UV and visible light irradiation. There- 
fore, the reversible resistance switching must 
be caused by the reversible photoisomerization 
of SP-SAM. 

Figure 3A shows the temperature dependence 
of the resistance after various UV irradiation 
times at 5 K. In the initial state, the K-Br device 
exhibited Mott insulating behavior at low tem- 
perature; this behavior is contrary to the super- 
conducting character of the bulk crystal and 
caused by the tensile strain from the substrate 
{20-22) (details are described in fig. S4). The re- 
sistance value decreased with increasing the ir- 
radiation time, and when the irradiation time 


reached 180 s, the device showed an abrupt re- 
sistance drop around 7.3 K; the drop was sup- 
pressed by a magnetic field of 7 T (fig. S5). This 
behavior indicates that the system showed a 
photoinduced phase transition from a Mott in- 
sulating phase to a partially superconducting phase. 

The photoinduced superconducting transition 
was also confirmed by magnetic susceptibility 
measurements (Fig. 3B). In the initial state, small 


shielding and Meissner effects were observed 
below ~10 K in zero-field-cooled and field-cooled 
measurements. After UV light irradiation, the 
zero-field-cooled curve was strongly enhanced, 
implying an increase in the superconducting 
fraction. The superconducting fraction created 
by UV light irradiation can be estimated by com- 
paring the diamagnetic value of the K-Br device 
with that of bulk K-Br. From this estimation, ~5% 




k B 

1 + v+ V r 

X X 



non-doped (SP state) hole-doped (MC state) 

. Zi \JY •# tett m « •<!**« 

V.S 



Mott Insulator 


Superconductor 


Fig. 1. Configuration of the K-Br device. (A) Schematic cross section of the device: the spiropyran (SP) 
monolayer can isomerize reversibly by UV and visible light irradiation at the interface (fig. SI). (B) Optical 
microscope image of the K-Br single crystal laminated on the substrate. Gold wires were attached to the 
crystal with carbon paste for standard four-probe measurement. Scale bar, 200 gm. 




Time (sec.) 


Fig. 2. Time trace of the resistance for the 
K-Br devices with and without SP-SAM 
under photoirradiation at 5 K. (A) After the 
resistance measurements in the dark for 180 s, 
each device was irradiated with UV light for 
180 s.The measurements were continued for 
180 s after the irradiation was stopped. (B) After 
the resistance measurements in the dark for 
180 s, each device was irradiated with visi- 
ble light for 180 s. The measurements were 
continued for 180 s after the irradiation was 


stopped. (C) Time trace of the resistance for the K-Br device with alternate photoirradiation of UV 
and visible light. 
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of the volume fraction seems to be caused by 
UV light irradiation, suggesting that the super- 
conducting transition occurs not only at the 
interface layer but also in the bulk of the device, 


because the volume fraction of the interface layer 
(monolayer) is only -0.3% (the thickness of the 
K-Br crystal used for the magnetic measurements 
was 600 nm, which corresponds to -300 layers). 



Fig. 3. Temperature dependence of the resistance and the magnetization for the K-Br device. 

(A) Resistance versus temperature for the K-Br device after various times of irradiation with UV light. 

(B) Magnetic susceptibility [zero-field-cooled (ZFC) and field-cooled (FC) measurements] for device 
4 in the initial state and UV-irradiated state under a field of 100 Oe. 


Because the carriers injected at the interface are 
unlikely to diffuse into the layers far from the in- 
terface, such a bulk phase transition should orig- 
inate from an interlayer dielectric screening of 
the Coulomb interaction, where effective repul- 
sion among Mott insulating carriers in the bulk 
of the device is screened by the mobile supercon- 
ducting carriers at the interface (23). A similar 
bulk phase transition has been also observed for 
the field-induced superconductivity in a single- 
crystal field-effect transistor (FET) of K-Br (20). 

We now turn to the study of the photoswitch- 
ing mechanism. K-Br has been shown to turn 
superconducting as a result of electrostatic car- 
rier doping (20). It is therefore reasonable to 
assume that the photoinduced phenomena pres- 
ented here are also caused by carrier doping by 
the photoisomerization of SP-SAM. It has been 
found that, when organic molecules with sizable 
dipole moments are organized as a SAM, a co- 
operative effect of charge transfer occurs be- 
tween the monolayer and the substrate (24, 25). 
The electric field produced by such a monolayer 
dipole can exceed the field that can be applied 
across the gate dielectric in a standard FET 
configuration. We can calculate the expected 
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Fig. 4. Evolution of the contour plot of resistance as a function of 
temperature and gate voltage during the photoirradiation and ex- 
tended phase diagram of the K-Br device. The contour plots were 
recorded in initial state (A); UV-irradiated states with irradiation time of 
30 s (B), 60 s (C), 90 s (D), 120 s (E), 150 s (F), and 180 s (G): and visible- 
irradiated state with the irradiation time of 300 s (H). (I) Phase diagram of 
the K-Br as a function of temperature and carrier concentration was obtained 
by overwriting the contour maps (B) to (G) on (A) with appropriate shifts in 
the gate voltage (irradiation time), because each contour map could be con- 
nected smoothly at nearly identical resistance point. SC and Ml denote the 
superconducting and the Mott insulating phases, respectively. 
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electric field inside a SAM through the follow- 
ing relationship 

-^in = -^(hmol / ^^mol ) ( 1 ) 

where N, cZmoi e are, respectively, the surface 
density, the height of the molecules, and the ef- 
fective dielectric constant inside the monolayer 
(26, 27). For our SP-SAM, we assumed that N is 
1.8 X 10^^ molecules/cm^, e is between 2 and 3, Pmoi 
is 6.4 D for the SP state and 13.9 D for the MC state, 
and cZmoi is 2.2 nm (13); conversion efficiency from 
SP-to-MC state is -66%. Using these values, we can 
estimate the difference in the electric fields (Ejn) 
between the SP and MC states as 5.0 - 7.5 MV/cm, 
which is much larger than the electric fields that 
can be produced by applying a gate voltage using 
the bottom-gate structure. 

To confirm this hypothesis, we performed gate 
voltage sweeps using the bottom-gate structure 
with various irradiation times. Figure 4 shows the 
contour plots of the resistance as a function of 
temperature and gate voltage for increasing times 
of UV light irradiation, which was performed at 
5 K. The irradiation was stopped during the gate- 
sweep mode electrical measurement. The K-Br 
device without SP-SAM was not affected by UV or 
visible light irradiation (fig. S9). For K-Br devices 
with SP-SAM, the contour plot shifted in the 
positive voltage direction with elapsed UV-light 
irradiation time, reflecting a gradual progress of 
the hole carrier injection, as expected from Eq. 1. 
After the irradiation time of 60 s, a sudden re- 
sistance drop appeared at negative gate voltage 
(Ug ~ -4 V), showing the (field-effect assisted) 
photoinduced superconducting transition. When 
the irradiation time reached 180 s, a superconduct- 
ing phase could be observed even without gate 
voltage. The observed voltage shift after 180 s is 
-9 V, which leads to the difference in the built-in 
potential of 4.3 MV/cm. This value is somewhat 
smaller than the results obtained theoretically by 
Eq. 1 (5.4 - 8.1 MV/cm). One reason for this dis- 
crepancy is a decline of the conversion efficiency 
of SP-to-MC photoisomerization because of the 
lamination of the crystal; it is well known that 
the free volume around spiropyrans has an effect 
on the photochromic response (28). 

Finally, a phase diagram of the K-Br as a 
function of temperature and irradiation time 
(with applied gate voltage) was obtained by 
superimposing Fig. 4, A to G, as shown in Fig. 41. 
We found continuous shift of the contour plots in 
Fig. 4, A to G, without noticeable color gradation 
difference in the overlapping areas. In addi- 
tion, we observed a portion of a superconducting 
dome resembling that of the superconducting 
cuprates (29) and the K-Br with conventional FET 
configuration (20). These results indicate that the 
photoinduced effect was caused not by thermal 
heating or interface chemical reactions but rather 
by the (hole) carrier doping. Furthermore, the 
field-induced carriers and the photoinduced car- 
riers were working as the same type of carriers 
that are indistinguishable from one another 
in the K-Br crystals. This means that SP-SAM 
EDL can expand the limit of electrostatic car- 
rier doping beyond the density that can be 


accumulated only by means of a normal FET 
configuration. 
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DENTAL MATERIALS 


Amorphous intergranular phases 
control the properties of rodent 
tooth enamel 

Lyle M. Gordon/* Michael J. Cohen/ Keith W. MacRenaris/ Jill D. Pasteris/ 
Takele Seda/ Derk Joester^f 


Dental enamel, a hierarchical material composed primarily of hydroxylapatite nanowires, 
is susceptible to degradation by plaque biofilm-derived acids. The solubility of enamel 
strongly depends on the presence of Mg^^, F“, and COa^". However, determining the 
distribution of these minor ions is challenging. We show— using atom probe tomography, 
x-ray absorption spectroscopy, and correlative techniques— that in unpigmented rodent 
enamel, Mg^^ is predominantly present at grain boundaries as an intergranular phase 
of Mg-substituted amorphous calcium phosphate (Mg-ACP). In the pigmented enamel, a 
mixture of ferrihydrite and amorphous iron-calcium phosphate replaces the more soluble 
Mg-ACP, rendering it both harder and more resistant to acid attack. These results 
demonstrate the presence of enduring amorphous phases with a dramatic influence on 
the physical and chemical properties of the mature mineralized tissue. 


T ooth enamel has evolved to resist the most 
grueling conditions of mechanical stress, 
fatigue, and wear (1). In addition, it is ex- 
posed to chemical attack in the corrosive 
environment of the oral cavity. Caries, or 
tooth decay, commonly begins with the de- 
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mineralization of enamel because of produc- 
tion of acid by oral bacteria (2, 3). Caries has 
an extremely high relative incidence of disease, 
with nearly 100% of adults worldwide having or 
having had caries (4). Although drinking water 
fluoridation is credited with reducing caries by 
an impressive 25% across all age groups, current 
dental expenditure in 2012 has been hovering 
at ~$110 billion in the United States alone (5). 
As the most prevalent chronic disease, caries 
thus remains an important public health prob- 
lem. A substantial challenge encountered in char- 
acterizing structure and chemistry of enamel is 
its complex, hierarchical, and graded architec- 
ture with substantial heterogeneity in structure 
and composition. Closing this gap in our knowl- 
edge will aid development of quantitative models 
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Fig. 1. Overview of mouse incisor enamel structure. (A and B) SEM images of a lactic acid-etched cross-section. (C) In this bright-field TEM image of a 
FIB-prepared thin section and (inset) selected-area diffraction pattern (SAED) of the edge of one enamel rod, alignment of nanowires parallel to the 
crystallographic c axis of the apatite lattice is apparent. Scale bars, 5 |j,m (A); 250 nm (B); 200 nm (C): and 2 nm“^ (inset). 



Fig. 2. SEM images of lactic acid-etched rodent enamel and sintered OHAp cross-sections. (A) 

Mouse (Mus musculus) incisor inner enamel. A comparison of sections before and after etching is provided 
in fig. S12. (B) Fluoride-treated mouse inner enamel. (C) Rat (Rattus norvegicus) pigmented enamel. (D) 
Sintered synthetic OHAp. Scale bars, 250 nm. 


of enamel mechanical properties, connect the 
tissue-level etiology of caries to the underlying 
material structure and properties, and help in- 
novate prophylaxis, early detection, and mini- 
mally invasive intervention. 

Enamel is composed of single-crystal hydrox- 
ylapatite [OHAp, Caio(P 04 ) 6 (OH) 2 ] nanowires 
that have a rhombohedral shape with typical 
cross-sectional edge lengths of 25 to 50 nm 
(Fig. 1) (6, 7). Nanowires are elongated parallel 
to the crystallographic [00.1] direction and have 
very large aspect ratios. On the order of 10^ 
nanowires are bundled in rods. Rods are ar- 
ranged in a dense three-dimensional (3D) weave 
interspersed with interrod enamel; this arrange- 
ment serves to deflect and arrest cracks (S). Spe- 


cific features depend, among others, on location 
within the enamel layer of a given tooth. In the 
inner enamel of rodent incisors, rods are ar- 
ranged in decussating layers (Fig. 1). In outer 
enamel, rods are packed in parallel instead 
and reach the enamel surface at a character- 
istic angle. Superimposed on the hierarchical 
structure, however, is a level of chemical het- 
erogeneity that has been much more difficult 
to capture. 

Enamel is highly mineralized at 96 weight 
percent (wt %) mineral, ~3 wt % water, and only 
wt % residual biomacromolecules (6). How- 
ever, even small amounts of substituting ions 
such as magnesium (0.2 to 0.5 wt % Mg^^), car- 
bonate (2 to 4 wt % COs^”), and fluoride (F“) 


strongly affect its properties (9). This is com- 
monly ascribed to substitutional defects in the 
OHAp lattice. For example, Mg^^ substitution for 
Ca^^ or substitution for either PO/“ or OH“ 

results in increased solubility {10). Given the low 
solubility of fiuorapatite, F“ substituting for OH“ 
is thought to decrease enamel susceptibility to 
acid attack and enhance remineralization. The 
concentration of these ions typically is graded 
over tens to hundreds of micrometers, with 
Mg^^ and COa^" concentration increasing from 
the enamel surface to the dentino-enamel junc- 
tion (3). Little is known about the distribution 
of ions at the length scale of rods or individual 
nanowires and the interfaces between them. 

We used a comparative approach to under- 
stand the interplay between susceptibility to 
acid attack, mechanical properties, and nano- 
scale structure and chemistry. Rodent enamel 
is a generally accepted model system for human 
enamel (ii). We therefore compared unpigmented 
“regular” rodent enamel with pigmented rodent 
enamel. Pigmented enamel is a reddish-brown, 
iron-rich type of outer enamel frequently found 
as an outermost layer on the incisors of rodents, 
shrews, and a few other species {12). TVpically 
only 10 to 15 pm thick, its structural organiza- 
tion, but not its chemistry, is similar to that of 
the regular outer enamel directly underneath. 

The susceptibility of enamel to dissolution in 
acidic environments was evaluated qualitatively 
by means of acid etching {13). Under conditions 
designed to approximate the intra-oral environ- 
ment during carious attack, we observed that 
etching is highly anisotropic even at the level of 
individual rods (Fig. 2). In regular mouse inner 
enamel, etching results in a “forest” of aligned 
nanowires, indicating that dissolution along 
the boundaries between {hk.0} facets is much 
more rapid than at the (00.1) face. When reg- 
ular enamel is topically treated with fluoride 
before etching, dissolution along the grain bound- 
aries is still the prevalent mode, but the extent is 
reduced. Although evidence in the literature led 
us to expect increased resistance of pigmented 
enamel to acid attack (14), we were surprised by 
the dramatic effect that indicates a change in 
mechanism. Specifically, etching parallel to the 
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grain boundaries is strongly suppressed, and 
slight dissolution of the nanowires on the (00.1) 
face leads to a characteristic honeycomb appear- 
ance in etched pigmented enamel. In contrast, 
etching of randomly oriented, polycrystalline 
OHAp with very low impurity levels is only mild- 
ly anisotropic. At comparable times of exposure 
to acid, only minor corrosion at the OHAp grain 
boundaries was observed, and there was little to 
no evidence of bulk dissolution or a strong de- 
pendence of etch rate on grain orientation. 

We confirmed these observations through 
quantitative assessment of the rate at which 
enamel is etched. Because of their larger size, 
incisors from the European rabbit {Oryctolagus 
cuniculus) were used as a model for regular 
enamel, and from the North American beaver 
{Castor canadensis) as a model for pigmented 
rodent enamel. Although these organisms be- 
long to different orders— the beaver is a rodent, 
and the rabbit is a lagomorph— recent evidence 
puts them in a monophyletic clade (Glires), and 
the structure of their outer enamel is similar (i5). 
An additional advantage of beaver incisor is 
that the outer enamel is much thicker (>100 pm) 
than, for example, in the rat, with no noticeable 
structural difference in the arrangement of ap- 


atite nanowires between the outermost 10 to 
15 pm of pigmented enamel and the underlying 
regular outer enamel (fig. SI). 

A known area, typically 10 to 20 mm^ in size, 
of the enamel surface was exposed to a lactic 
acid solution (250 mM, initial pH = 4.0, 37°C). 
The concentration of relevant ions in the etchant 
was monitored over time by using inductively 
coupled plasma mass spectrometry (ICP-MS) 
(fig. S14). The average loss of enamel mass per 
unit time and area was determined from time- 
concentration data, assuming a linear depen- 
dency, which is reasonable at short etching 
times. Consistent with electron microscopy and 
earlier qualitative observations {14), pigmented 
beaver enamel etches at a much lower rate 
(115 + 15 ng mm"^ min"\ n = 3 incisor segments) 
than that of regular rabbit enamel (655 + 60 ng 
mm"^ min"\ n = S incisor segments). At least 
some of this difference could in principle arise 
from slight structural or chemical differences 
in the outer enamel between the two species. 
However, when the regular outer enamel of the 
beaver was exposed with the careful mechan- 
ical removal of the pigmented layer, the etch- 
ing rate was comparable with that of rabbit 
enamel (695 ± 60 ng mm"^ min"\ n = 3 xxxxx). 


Topical treatment of the enamel so exposed with 
fluoride at neutral pH reduced etching rates sub- 
stantially (205 + 35 ng mm"^ min"\ n = 3 incisor 
segments). However, the etch rate of fluoridated 
enamel is still about twice faster than that of 
pigmented enamel, indicating that pigmenta- 
tion affords better protection against acid attack. 
However, remineralization in the presence of 
fluoride, at different pH, and over longer time 
periods may give different results. Taken together, 
the electron microscopy of etched enamel sur- 
faces and quantitative measurement of etch 
rates is strong evidence that the composition of 
enamel is heterogeneous across the diameter 
of individual nanowires and that this hetero- 
geneity affects the anisotropic etching of grain 
boundaries and the overall susceptibility of 
enamel to corrosion. 

Beyond the impact on acid etching, enamel 
composition also affects the mechanical proper- 
ties. We systematically determined enamel hard- 
ness as a function of the distance from the 
enamel surface using nanoindentation (fig. S2). 
Pigmented outer enamel of beavers and rats is 
harder (5 to 6 GPa, n = 3 incisors, 170 indents) 
than the underlying nonpigmented outer (4 to 
5 GPa, n = 3 incisors, 386 indents) and inner 
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Fig. 3. APT reconstructions and compositional profiles. (A) Mg (^^Mg^^) 
ion positions in mouse outer enamel. (B) F (^°Ca^^F^) ion positions in fluoride- 
treated mouse inner enamel. (C) Fe (^®Fe^^) ions in pigmented rat enamel. 
Scale bars, 10 nm. The view direction is parallel to the long axis of the nano- 
wires. (D to F) Representative concentration profiles across grain boundaries. 
(G) Isosurface (0.5 atomic %) surrounding Mg-rich multiple grain boundary in 


(A). (H) Isosurfaces (5 atomic %) enveloping Fe-rich multiple grain boundary in 
(C). Scale bars in (G) and (H), 5 nm. (I to K) Representative proxigrams of 
multiple grain boundaries in (I) mouse outer enamel, (J) fluoride-treated outer 
mouse enamel, and (K) pigmented rat enamel. The distance d is relative to the 
isosurface. In concentration profiles and proxigrams, the solid line represents 
the mean and the shaded area the mean ± SD, based on counting statistics. 
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enamel (4.5 GPa, n = 3 incisors, 18 indents). The 
hardness does not change abruptly but rather 
decreases gradually from the pigmented enamel 
surface toward the nonpigmented outer enamel. 
The concentration of iron is graded in the same 
direction (fig. S2). In rat enamel, a structural 
transition between the outer and inner enamel 
may contribute to the gradient in mechanical 
properties {12, 14). In beaver enamel, no such 
transition is apparent. Furthermore, in both rat 
and beaver, hardness is nearly constant across 
the unpigmented outer enamel of the extreme 
mesial and distal aspect of the incisor (fig. S2). 
Clearly, the presence of iron has a substantial 
impact on enamel hardness. A likely function 
of the hardness gradient is to maintain a sharp 
edge on the continuously growing incisor {15). 

Thus, iron not only protects enamel against 
acid attack but also increases mechanical hard- 
ness. Etching indicates that there are compo- 
sitional differences across grain boundaries 
between nanowires. Atom probe tomography 
(APT) is a technique suited to compositional 
mapping at such length scales {16, 17). The re- 
cent introduction of ultraviolet laser-pulsed APT 
has enabled the investigation of the chemical 
nanostructure of mineralized tissues such as the 
chiton tooth, bone, and dentin {16, 17). During 
APT, single atoms or small clusters on the surface 
of a very sharp specimen are field-evaporated 
after a laser pulse. The resulting ions are pro- 
jected onto an imaging detector and identified 
by their time of flight. The identity and position 
of each ion are used to create 3D reconstructions 
of the sample, with a spatial resolution typical- 
ly better than 0.2 nm. Because of its unbiased 
sensitivity across the periodic table, APT is par- 
ticularly powerful for samples with low-atomic- 
number components. 

We prepared samples of inner and outer en- 
amel from ground and polished tooth sections 


by means of focused ion beam (FIB) milling {18). 
APT spectra show the typical features of OHAp 
and a range of minor constituents, including Mg 
and Na in regular enamel, and Fe in pigmented 
enamel (fig. S3) {17). In 3D reconstructions, the 
cross sections of faceted nanowires are apparent 
from the distribution of minor ions (Fig. 3, A to 
C). At grain boundaries in regular enamel, Mg^"^ 
is enriched ~20 times over the apparent solubil- 
ity limit of -0.03 atomic % in the bulk of the 
nanowire (table SI). This peripheral enrichment 
confirms earlier in vitro experiments, showing 
that Mg^^ is not readily incorporated into the 
apatite crystal lattice {10, 19). In pigmented enam- 
el, Fe is almost completely excluded from the 
interior of the nanowires (< 0.2 atomic %). No 
Fe was detected in regular enamel, and only 
trace amounts of Mg were present in pigmented 
enamel (table SI). Concentration profiles across 
the grain boundaries (Fig. 3, D to F) reveal that 
the concentration of minor ions is sharply ele- 
vated over a narrow region. The absolute con- 
centration and total amount of Mg or Fe atoms 
at grain boundaries varies considerably, even 
between the boundaries on one nanowire and 
its neighbors. This variability may result from 
differences in surface energy of the particular 
facets but is likely also dependent on the ki- 
netics of crystallite growth. 

A convenient way to visualize the 3D distribu- 
tion of impurities at multiple grain boundaries is 
by way of an isoconcentration surface (isosur- 
face) (Fig. 3, G and H) that encloses a volume in 
which the concentration of a given ion is higher 
than a threshold. Proximity histograms (proxi- 
grams) (Fig. 3, 1 to K) report the average mole 
fraction of ions as a function of distance to this 
surface {20). The physical dimensions and the 
exceptionally high concentration of trace ions 
in the associated proxigrams — 60 times the 
apparent solubility limit for Mg and at least 80 


times the solubility limit for Fe— are strong 
indications that an intergranular phase that is 
chemically and structurally distinct from apatite 
is present. In regular enamel treated topically 
with fluoride, there is colocalization of F and 
Mg (Fig. 3, E and J, and fig. S13), indicating 
that grain boundaries and grain edges serve as 
short-circuit diffusion pathways. Thus, an un- 
derstanding of the fundamental processes of 
tooth decay must account for transport along 
grain boundaries and the participation of the 
Mg-rich intergranular phase {21). Intergranular 
phases are known to strongly affect mechanical 
properties of ceramic materials {22)', the increased 
hardness of pigmented enamel is likely a result 
of the structural and compositional difference 
of the interphase. 

Although the existence of Mg-rich phases in 
enamel— in particular, dolomite [CaMg(C03)2], huntite 
PdgA(CD3)4], and vdiMockite Ca9Mg(P04)6(P030H)— 
has been proposed previously, these phases have 
not been observed experimentally {19). The strong 
enrichment of Mg^^ at interfaces and in the in- 
tergranular phase (-90% of total Mg) observed 
by APT allows us to use x-ray absorption spec- 
troscopy to probe its structure. The position and 
intensity of transitions in Mg K-edge x-ray ab- 
sorption near-edge structure (XANES) spectra 
are sensitive to the coordination number, geom- 
etry, bond length, and order at intermediate range. 
Many Mg-containing minerals can be identi- 
fied by their spectral fingerprints {23). Spectra of 
enamel lack the pre-A, D, and E peaks that are 
characteristic for crystalline dolomite, huntite, 
and whitlockite (Fig. 4, A and B). However, the 
Mg-rich intergranular phase shows a striking 
similarity to spectra of synthetic Mg-substituted 
amorphous calcium phosphate (Mg-ACP), Mg/ 
phosphate rich amorphous calcium carbonate 
(ACC) in the lobster cuticle, and Mg-silicate glasses 
{23, 24). The dominant feature in spectra of these 



Fig. 4. X-ray absorption spectra. (A) Mg K-edge XANES and (B) /^^-weighted EXAFS spectra. (C) Fe K-edge XANES and (D) /^^-weighted EXAFS spectra. A post- 
edge shoulder at -7138 eV (black arrow) appears in pigmented enamel and amorphous FeP 04 - 2 H 20 . In EXAFS spectra, fits (dashed lined) are overlaid on 
experimental data (solid lines). 
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amorphous materials is the transition associated 
with the first coordination sphere (feature B), with 
little or no features that depend on order beyond 
the first shell. At the same time, the lower-edge 
energy is indicative of a lower coordination num- 
ber and Mg-0 bond distance consistent with amor- 
phous materials. 

Confirming this assessment, the Mg K-edge 
extended x-ray absorption fine structure (EXAFS) 
of enamel is nearly indistinguishable from that 
of synthetic Mg-ACP (Fig. 4, A and B, and fig. S4). 
In sharp contrast to the crystalline reference com- 
pounds, in which numerous scattering features 
from more distant shells are apparent, spectra 
of enamel and Mg-ACP are dominated by the 
nearest neighbor shell. Analysis of the local en- 
vironment around Mg by fitting FXAFS spectra 
with theoretical scattering paths (table S2) re- 
veals that the nearest-neighbor Mg-0 bond lengths 
in enamel (2.03 A) and Mg-ACP (2.02 A) are no- 
tably shorter than Ca-0 bonds in OHAp (2.40 A) 
and ACP (2.36 A) and also shorter than the Mg-0 
bonds in the crystalline reference compounds 
(2.08 to 2.11 A) {25, 26). However, Mg-0 bonds in 
enamel are similar in length to Mg-0 bonds ob- 
served in Mg-substituted ACC {24). In enamel 
and Mg-ACP, this shortening is accompanied by 
a reduction in the coordination number from 6 
to ~4, which is indicative of an amorphous ma- 
terial and/or the presence of water in the first 
coordination sphere {23, 24). We conclude that 
the environment of the majority of Mg^^ in enamel 
exhibits only short- to medium-range order, with 
dramatic Mg-0 bond shortening, a reduction in 
coordination number, and the possibility of water 
or hydroxyl ions in the first shell, similar to the 
environment of Ca in ACP {25). Although we favor 
the interpretation that Mg-ACP is present as an 
interphase, an alternate hypothesis is that Mg^^ 
occupies disordered Ca[II] sites in apatite {27). In 
either case, the disordered environment around 
Mg is likely to greatly increase the solubility at 
the periphery of OHAp nanowires and result in 
anisotropic etching. 

Qualitative comparison of Fe A-edge XANFS 
and FXAFS spectra of pigmented beaver enamel 
and reference standards suggests that iron is 
present in a ferrihydrite-like material (Fig. 4, C 
and D, fig. S5, and table S2). Quantitative anal- 
ysis of FXAFS along with Mossbauer (fig. S6) 
and Raman (fig. S7) spectroscopy confirms the 
presence of poorly crystalline ferrihydrite (Fh) 
{28). Because the presence of phosphate is known 
to inhibit growth of crystalline iron oxyhydrox- 
ides, we would expect that metastable Fh forms 
from any Fe(II) secreted by ameloblasts, through 
oxidation to Fe(III) and subsequent hydrolysis 
{29). The presence of Fh in pigmented beaver 
enamel is in agreement with results from nutria 
{Myocastor coypus) {30) but differs from the 
“nearly amorphous magnetite” postulated for pig- 
mented enamel of the shrew Blarina brevimiida, 
in which pigmented enamel may have arisen 
through convergent evolution {12). 

However, with both Ca^"" (7 to 19 atomic % Ca) 
and PO/" (8 to 13 atomic % P) present in APT 
reconstructions of the Fe-rich intergranular 


phase, the material cannot be simply pure Fh. 
Furthermore, the stability of the intergranular 
phase against transformation upon heating is 
much greater than that of bulk Fh and leads to 
different products (fig. S8) {31). Analysis of the 
pre-edge region (fig. S9) reveals that the lig- 
and field-splitting energy of pigmented enam- 
el {lODq = 1.13 eV) is larger than that of pure 
ferrihydrite (0.96 eV) and a small post-edge 
shoulder (Fig. 4C). Both are consistent with the 
presence of hydrated amorphous iron phos- 
phate (a-FeP 04 ' 2 H 20 , lODq = 1.19 eV). According 
to linear combination analysis of FXAFS spec- 
tra, as much as 42% of the iron may be present 
in this form (fig. SIO). Unlike the correspond- 
ing crystalline phases, amorphous phases can 
be very accommodating of substituents. The 
iron-rich interphase may thus be best described 
as a mixture of Fh and amorphous iron-calcium 
phosphate. Indeed, we observed that the inter- 
phase is strongly graded, ranging in composition 
from calcium-rich (Fe/Ca = 0.2) at the periphery 
to iron-rich (Fe/Ca = 4) in the center. Given the 
corrosion resistance of iron phosphates, replac- 
ing Mg-ACP with iron phosphate likely hardens 
pigmented enamel against acid attack {32). 

We find that Mg^^ segregates at grain boundaries 
in regular rodent enamel, as does Fe^^ in pigmented 
enamel. We present evidence for the presence 
of Mg-ACP as an intergranular phase in regular 
enamel and of a mixture of Fh and amorphous iron- 
calcium phosphate in pigmented enamel. Last, 
we link interphase structure and composition to 
the resistance of pigmented enamel against acid 
attack and its improved hardness. In addition to 
the recent recognition of amorphous phases in 
biomineralization as transient precursors {33, 34), 
these results demonstrate enduring amorphous 
phases with a dramatic influence on the physical 
and chemical properties of the mineralized tissue. 
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ATMOSPHERIC CHEMISTRY 

Direct kinetic measurement of the 
reaction of the simplest Criegee 
intermediate with water vapor 

Wen Chao,^’^ Jim-Ting Hsieh/’^ Chun-Hung Chang, ^ Jim Jr-Min Lin^’^’^* * 

Carbonyl oxides, or Criegee intermediates, are important transient species formed in 
the reactions of unsaturated hydrocarbons with ozone. Although direct detection of 
Criegee intermediates has recently been realized, the main atmospheric sink of 
Criegee intermediates remains unclear. We report ultraviolet absorption spectroscopic 
measurements of the lifetime of the simplest Criegee intermediate, CH 2 OO, at various 
relative humidity levels up to 85% at 298 kelvin. An extremely fast decay rate of 
CH 2 OO was observed at high humidity. The observed quadratic dependence of the 
decay rate on water concentration implied a predominant reaction with water dimer. 

On the basis of the water dimer equilibrium constant, the effective rate coefficient of 
the CH 2 OO + (H20)2 reaction was determined to be 6.5 (+0.8) x 10“^^ cubic centimeters 
per second. This work would help modelers to better constrain the atmospheric 
concentrations of CH 2 OO. 


C riegee intermediates (CIs) have long been 
implicated in atmospheric reactions of 
ozone with unsaturated hydrocarbons, but 
only recently were simple CIs detected via 
their preparation through the reactions of 
a-iodoalkyl radicals with O 2 (1). The reaction 
rate coefficients of simple CIs with atmospheric 
trace gases, including SO 2 , NO 2 , and carboxylic 
acids, could then be measured directly using vac- 
uum ultraviolet photoionization mass spectrom- 
etry (VUV-PIMS) (1-3). These measurements 
indicated that the previously reported rate coef- 
ficients derived from indirect ozonolysis experi- 
ments had been underestimated by factors of 50 
to 10,000 (1-3). Because CIs are important in the 
oxidative capacity of the atmosphere, as well as 
radical generation and aerosol formation there 
(4-6), the higher rate coefficients implied a greater 
role of CIs in atmospheric chemistry than previ- 
ously assumed. However, the precise kinetics of Cl 
reactions with water vapor have not yet been 
determined, even though these water reactions 
may control the atmospheric fate of CIs by virtue 
of the comparatively high abundance of water 
vapor. 

Welz et al. (1) investigated the kinetics of the 
simplest Criegee intermediate, CH 2 OO, by using 
VUV-PIMS to monitor CH 2 OO concentration. 
Although the rate coefficients of CH 2 OO reac- 
tions with SO 2 and NO 2 were determined, the 
authors did not observe any detectable reaction 
of CH 2 OO with water (reaction 1 ) and set an upper 
bound for the rate coefficient of 4 x lO"^ cm^ s"^ 
(1). Using a similar technique, Taatjes et al. mea- 
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sured the rate coefficient of the reaction of anti- 
CH3CHOO with water to be 1.0 (±0.4) x 10 "^“^ cm^ s~\ 
but still could not observe the reactions of water 
with CH 2 OO and s^/tz-CHsCHOO (2). Stone et al. 
reported < 9 x 10 "^^ cm^ s"^ by measuring the 
laser-induced fluorescence of H 2 CO, a product 
in the CH 2 OO reaction system (7). Ouyang et al. 
reported ki = 2.5 (±1) x 10“^^ cm^ s“\ determined 
in a relative rate experiment monitoring NO3 
production (8). 

Evidence for substantial reactivity of CH 2 OO 
with water has been reported in C 2 H 4 ozonolysis 
experiments (9, 10), despite a large scatter in the 
reported rate coefficient ( 10 "^^ to 10 “^^ cm^ s"^) 
(11-13). Berndt et al. (14) investigated H 2 SO 4 
formation from SO 2 oxidation in C 2 H 4 ozonolysis 
at various relative humidities (H^ up to 50%. 
Through analysis of the H 2 SO 4 yields, the au- 
thors implied a preferred reaction of CH 2 OO 
with water dimer, as suggested by the theoret- 
ical works of Ryzhkov and Ariya (15). 

Bemdt et al. (14) further pointed out that their 
result, as well as a series of earlier studies (9-13) 
using C 2 H 4 ozonolysis for CH 2 OO formation, in- 
dicates that the CH 2 OO reaction with water 
vapor is predominant under near-atmospheric 
conditions. However, this conclusion was not sup- 
ported by other recent studies (1, 2, 7, 8) that ap- 
plied the CH 2 I 2 -O 2 photolysis technique (CH 2 I 2 + 
/zv ^ CH 2 I + I; CH 2 I + 02 ^ CH 2 OO I). This 
discrepancy is puzzling, especially when detailed 
spectroscopic investigations in the VUV (1), UV 
(16-18), infrared (19), and microwave (20) spec- 
tral regions have confirmed that CH 2 OO is the 
predominant isomer formed in the CH 2 I 2 -O 2 
photolysis process. In addition, the kinetics of 
ozonolysis experiments are inherently complex: 
Reliable data for individual Cl reactions can be 
difficult to obtain under such conditions, as other 
oxidants (ozone and/or ozonides) are likely in- 
volved in complicated side reactions that may 


obscure the rate determinations (1-5). In partic- 
ular, the rate coefficients of Cl reactions with SO 2 , 
NO 2 , and carboxylic acids are found to be much 
faster in the direct determinations (1-3) than in 
the ozonolysis analyses (6, 21, 22), suggesting the 
need for reinterpretation of earlier ozonolysis 
measurements (3, 4). 

To clarify the kinetics of Cl reactions with wa- 
ter, which is ubiquitous in the atmosphere, we 
performed direct rate measurements at multiple 
humidity levels. CH 2 OO was synthesized by 
CH 2 I 2 -O 2 photolysis and probed via its strong 
UV absorption (16). The decay of CH 2 OO was 
detected with two complementary methods: One 
recorded the transient difference absorption 
spectra (280 to 600 nm) with an image-intensified 
charge-coupled device (iCCD) spectrometer at var- 
ious delay times after the photolysis laser pulse 
(16, 23); the other recorded full time traces of 
absorption change at 335 to 345 nm using a 
bandpass filter and a balanced photodiode 
detector (24). 

Typical transient absorption spectra of the 
CH 2 I 2 -O 2 photolysis system are shown in Fig. 1. 
The strong UV absorption band of CH 2 OO 
can be seen at a delay time of 15 ps. CH 2 OO is 
quite reactive; it decays at longer delay times, 
and depletion of CH 2 I 2 and formation of 10 
can also be clearly observed. The transient ab- 
sorption spectra can be decomposed into absorb- 
ance bands attributable to CH 2 I 2 (depletion), 
CH 2 OO, and 10 using their reported spectra 
(16, 25) (Fig. 1C). The corresponding number 
densities can be found in fig. S4. Detailed ki- 
netic analysis of the CH 2 I 2 -O 2 photolysis sys- 
tem (without water) has been published (26). 
When water vapor was introduced (Fig. IB), 
the CH 2 OO absorption band was found to de- 
crease much faster, indicating CH 2 OO reac- 
tion with water vapor. 

Selective detection of CH 2 OO could be per- 
formed by choosing a wavelength window of 335 
to 345 nm (Fig. 1), where CH 2 OO absorbs much 
more strongly than CH 2 I 2 and 10 (their cross 
sections are 1.21 x 10"^^ cm^, 8.58 x 10"^® cm^, and 
2.67 X 10"^® cm^, respectively, averaged over 335 
to 345 nm) (16, 25). 

Other species, such as H 2 CO and ICH 2 OO, may 
also be formed in the reaction cell. The absorp- 
tion of H 2 CO is rather weak, but the UV spectrum 
of ICH 2 OO is uncertain (26). Fortunately, the 
CH 2 OO spectrum has characteristic vibronic 
structures on the long-wavelength side (355 to 
405 nm) (16). Ting et al. (26) made use of this 
feature to show that the absorption band of 
ICH 2 OO does not overlap with that of CH 2 OO 
and hence would not interfere with the concen- 
tration determination of CH 2 OO via its UV 
absorption. 

Figure 2 shows typical temporal profiles of 
CH 2 OO concentration, [CH 2 OO], which decays 
much faster at higher humidity. The resultant 
decay rates are consistent with those from the 
iCCD measurements (24). Figure 3A shows the 
decay rate coefficient (A^obs) of CH 2 OO as a func- 
tion of TTr. Remarkably, A^obs depends on the 
relative humidity quadratically. Figure 3B plots 
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^obs against the concentration of water dimer 
estimated with the reported equilibrium con- 
stant {27). The linear behavior of A^obs versus 
[(H20)2] indicates that the reaction of CH2OO 
with water dimer (reaction 2) predominates. 

Even under the dry condition, CH2OO has a 
finite lifetime, mainly due to the reactions with 
itself and with other radical species {26). We 
assume that the overall decay rate of CH2OO is 
the sum of the dry decay rate, A:o[CH200], and 
the rate due to reaction with water, A:^[CH200]: 

- = K [CHpO] + A^pHpO] 

= fc^[CHpO] (1) 


where is the effective first-order rate co- 
efficient of the CH2OO reaction with water 
vapor. The observed plotted as a function 
of i^R in Fig. 4 and in figs. S5 to S8, can be well 
fitted with Eq. 2, regardless of different total 
pressures, initial concentrations, and buffer 
gases, indicating a quadratic rate dependence 
on water: 


where A^h is a proportional constant. 

Because not only CH2OO but also other radical 
species (I, OH, etc.) are formed in the CH2I2-O2 
photolysis system, we need to test the effect of 
other radical species. We varied the radical 
concentrations by a factor of 4 and found that 
the concentrations of radical species (includ- 
ing CH2OO) greatly affect the CH2OO lifetime 
under the dry condition {26) but do not affect 
the rate due to the CH2OO reaction with wa- 
ter vapor (table SI and fig. S2), indicating the 
validity of Eq. 1. In addition, we have tested the 
effect of OH radicals by adding 5 torr of pro- 
pane, an efficient OH scavenger, to the reac- 
tion system. We found that addition of propane 
makes no difference for the rates of CH2OO 
reaction with water vapor (fig. S3), indicating 
that OH radicals do not interfere with our rate 
determination. 

The water dimer concentration [(H20)2] can 
be estimated with Eq. 3, which takes into account 
the reported equilibrium constant Xp = 0.0501 
bar"^ at 298 K {27) and the ideal gas law: 


[(H20)2] =N, 


K ( H, 


, -5-P 

RT Vl00% 


= 1.23 X 10 


15 / H„ 

1 I 


Vioo% 


cm (3) 


Our A^h value of 7968 s"^ corresponds to k 2 = 
6.5 X 10"^ cm^ s"\ The manufacturer-specified un- 
certainty of our relative humidity sensor is 3.5% 
(we have verified this by using a few saturated 
salt solutions), which translates to 7% error in 
A^h- The vapor pressure of water has a consider- 


able temperature dependence, about 7% per 
kelvin, which corresponds to 5% uncertainty in 
[H2O] for our temperature stability of 0.7 K. The 
uncertainty in the dimer equilibrium constant is 
3.2% with a negative temperature dependence of 
-2% per kelvin {27). Then the overall percentage 



-i 1 1 ' ■ I ■ ' 1 ' 1 ' 1 ' 1 ' 1 ' r 

280 300 320 340 360 380 400 420 440 460 480 


Wavelength / nm 


where is Avogadro’s number and PsaX. is 
the saturation water vapor pressure (23.8 torr 
at 298 K). Because k^ scales quadratically with 
respect to and linearly with respect to [(H20)2], 
as demonstrated in Figs. 3 and 4, we have A^w = 
A:2[(H20)2]. 


Fig. 1. Difference transient absorption spectra recorded in the CH 2 I 2 -O 2 photolysis. (A and B) Spectra 
are shown at selected delay times after the photolysis laser pulse under dry conditions (Hr < 3%) (A) and 
wet conditions (Hr = 72%) (B). (C) Experimental transient absorption spectra were decomposed into 
bands assigned to CH 2 I 2 (depletion), CH 2 OO, and 10 using their reported spectra { 16 , 25 ). (See fig. S4 for 
details). The vertical dashed lines indicate the detection window of the balanced photodiode detector (335 
to 345 nm). 
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uncertainty in [(H20)2] can be estimated as l( 7 %f 
+ (2 X 5% - 0.7 X 2 %f + = 12%. Thus, we 

report = 6.5 (±0.8) x 10“^^ cm^ s“^ at 298 K. 


Including the water monomer reaction in 
the decay kinetics of CH2OO does not improve 
the fit to the experimental data. Considering 



Delay Time / ps 

Fig. 2. Typical temporal profiles of the absorbance change near the peak of the CH 2 OO band. 

Absorption was monitored with a balanced photodiode detector and a bandpass filter (335 to 345 nm) 
at various relative humidity levels (Hr = 0 to 83%). The smooth lines are single-exponential fits to the 
experimental data, which were averaged for 60 to 120 laser shots (experiment 3) (24). Because 10 absorbs 
rather weakly in this wavelength window and the depletion of CH 2 I 2 is constant after the photolysis 
(causing negative baselines at long times), these temporal profiles mainly represent changes in [CH 2 OO]. 


Fig. 3. Dependence of CH 2 OO 
decay rate coefficient on water 
monomer and dimer concentra- 
tions. (A) The decay rate 
coefficient of CH 2 OO (kobs) 
measured under various relative 
humidity levels Hr (experiment 3). 
kobs is obtained from single- 
exponential fit to the time traces 
measured with the balanced 
photodiode detector. (B) The 
same rate data as in (A), but 
plotted against the estimated 
concentration of water dimer. The 
horizontal error bar includes the 
uncertainties in the relative 
humidity and temperature (see 
text). The vertical error bar is the 
error in the exponential fitting (see 
table S3) but is too small to be seen. 



our experimental error bars, the upper limit of 

is estimated to be 1.5 x lO"^^ cm^ s"^. 

Our data demonstrate that two water mole- 
cules are required for the reaction that consumes 
CH2OO. We cannot distinguish, merely by the 
experimental data, whether it is due to the re- 
action of CH200 -i-(H 20)2 or a reaction such as 
CH200(H20) -I- H2O. Therefore, the reported k2 
should be regarded as an effective rate coefficient. 
Note that the pre-reactive complexes, CH200(H20) 
and CH200(H20)2, would not affect the detec- 
tion of CH2OO because of their low equilibrium 
concentrations (75). 

Previous direct kinetic measurements of Cl 
reactions with water vapor were performed by 
using the VUV-PIMS techniques (7, 2 ). Although 
VUV-PIMS is a powerful tool that can selectively 
detect a conformer or isomer on the basis of 
their different ionization thresholds, the total 
pressure of the reactor is limited to only a few 
torr. As a result, the water pressure used was 
1 torr or less ([H2O] < 3.2 x 10^^ cm“^), and 
[(1126)2] would be too low (< 2 x 10^^ cm"^ at 
298 K) to affect the decay rates of CIs. 

Using the same CH2I2-O2 photolysis source. 
Stone et al. (7) and Ouyang et al. ( 8 ) investigated 
the reaction at higher PH2O— up to 5 and 21 torr, 
respectively— but with indirect detection (detect- 
ing the effect of humidity on the CH2OO reaction 
with a third reagent). Both groups did not observe 
any reaction of CH2OO with water dimer, how- 
ever. The discrepancy might be explained by the 
following: Stone etal detected the laser-induced 
fluorescence of H2CO, one likely product in the 
CH2OO reaction system (for example, CH2OO -1- 
X — > XO -I- H2CO, where X denotes a radical); 
Ouyang et al. measured the yield of NO3 in the 
reaction of CH2OO with NO2 (CH2OO -1- NO2 ^ 
NO3 -I- H2CO). If the products of the CH2OO 
reaction with water dimer also end up forming 
the same detected product [H2CO for (7), NO3 
for (8)], the effect of adding water might not be 
very obvious in their experimental observations. 
The fluorescence signal of H2CO is also quenched 
by water, further complicating the situation of 
Stone et al. As a result, the rate coefficient of the 
water dimer reaction with CH2OO might be 
underestimated in their analyses using compli- 
cated but incomplete reaction mechanisms. 

The above assumption is partly supported by 
the theoretical work of Ryzhkov and Ariya (75). 
They have proposed that the products of CH2OO -1- 
(H20)2 are likely hydroxylmethyl hydroperoxide 
(HMHP; HOCH2OOH) and H2O; this reaction is 
quite exothermic, and some of the internally ex- 
cited HMHP may decompose to HOOH and H2CO 
with H2O acting as a catalyst (75). 

Berndt et al. (74) investigated the competitive 
kinetics for CH2OO reactions with SO2 (reaction 
3) and with water vapor by measuring the H2SO4 
product yield at PrsO ^ 12 torr in a C2H4 
ozonolysis experiment. They found considera- 
ble reduction in the H2SO4 product yield at 
high humidity and suggested k2lks = 0.29 ± 0.01 
at 293 K; that is, k2 = 1.2 x 10“^^ cm^ s~\ assuming 
k3 = 4 ! X 10“^^ cm^ s"^ (7). This value of k2 is about 
twice our value. The discrepancy may arise from 
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Fig. 4. Effective first-order rate coefficient of CH 2 OO reaction with water vapor fCw plotted as a 
function of relative humidity Hr. There are total 188 measurements at 298 K with buffer gases of N 2 
(experiments 1 to 6), O 2 (experiment 7), SFe (experiment 8), N 2 (experiment 9, the reference experiment 
for 10), and N 2 + CsHs (experiment 10). A different humidity sensor was used for experiments 9 and 10, 
which caused some deviation of the data points, but they are within the error bars of the humidity 
sensors (24). The total pressures are also listed in the figure. The quadratic fit to all data points yields 
kH = 7968 s“\ 


Table 1. Estimation of the effective first-order rate coefficients keff for CH 2 OO reactions with 
water vapor and atmospheric key species. 


Co-reactant 

^reaction 

(cm^ s-^) 

Assumed 

concentration^ 

Number density^ 
(cm-3) 

l(eff 

(s-‘) 

(H20)2 

6.5 X 10“^2* 

Hr > 36% 

>1.6 X 10^^ 

>1000 

SO 2 

4 X 10"^^+ 

50 ppb 

1.2 X 10^2 

50 

NO 2 

7 X l0-i2t 

50 ppb 

1.2 X 10^2 

9 

Carboxylic acids 

1 X 10“^°* 

5 ppb 

1.2 X 10^1 

12 


*This work. i(l). t(3). §At 298 K and 760 torr. 


the complexity of the ozonolysis reaction relative 
to individual Cl reactions. For example, with 
similar H 2 SO 4 detection, the rate coefficients of 
simple Cl reactions with SO 2 in ozonolysis ex- 
periments were reported to be on the order of 
10"^^ cm^ s"^ (22), much slower than those (2.4 x 
10 "^^ to 6.7 X 10 “^^ cm^ s"^) obtained from the di- 
rect determinations (2). 

Ryzhkov and Ariya (75) have performed quan- 
tum chemistry calculations on the reaction of 
CH 2 OO with water dimer and estimated the 
effective rate coefficient at various levels 
and temperatures. At 298 K, their estimated 
effective rate coefficient [figure 5 of (75)] is of 
the same order of magnitude as our experi- 
mental values. 

Even in a relatively diy atmosphere (77 r ~ 35%), 
the decay of CH 2 OO due to water is much faster 
than other CH 2 OO reactions with SO 2 , NO 2 , and 
carboxylic acids (Table 1). Recent direct kinetic 
measurements of Cl reactions with carboxylic 


acids suggest that the reaction with carboxylic 
acids is a substantially more important loss 
process for CIs than is previously assumed (3). 
But Welz et at. (3) used a value of derived 
from (7) that is smaller than this new determi- 
nation by a factor of 20 , leading to underestima- 
tion of the effect of the water dimer reaction 
with CH 2 OO. 

Theoretical calculations (75, 28, 29) and in- 
vestigations of alkene ozonolysis (14, 22, 30) 
indicate that the reactivity of CIs toward water 
depends on their structures. For example, anti- 
CH3CHOO was found to be more reactive to 
water than s^/tz-CHsCHOO (2); the above data 
show that water dimer is much more reactive 
than water monomer for CH2OO. Furthermore, 
our preliminary investigation of CH3CHOO sug- 
gests that the syn form reacts with water much 
more slowly than the anti form; the latter also 
reacts quickly with water dimer (see fig. S13). 
The role of water and water dimer in the at- 


mospheric chemistry of CIs surely demands 
further investigation. The method demonstra- 
ted in this work opens a door to direct kinetic 
measurements of CIs under near-atmospheric 
conditions. 
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NEURAL CIRCUITS 

Labeling of active neural circuits 
in vivo with designed 
calcium integrators 

Benjamin F. Fosque,*t Yi Sun,* Hod Dana,* Chao-Tsung Yang, Tomoko Ohyama, 
Michael R. Tadross, Ronak Patel, Marta Zlatic, Douglas S. Kim, Misha B. Ahrens, 

Vivek Jayaraman, Loren L. Longer, Eric R. Schreiter^ 

The identification of active neurons and circuits in vivo is a fundamental challenge in 
understanding the neural basis of behavior. Genetically encoded calcium (Ca^^) indicators 
(GECIs) enable quantitative monitoring of cellular-resolution activity during behavior. 
However, such indicators require online monitoring within a limited field of view. 
Alternatively, post hoc staining of immediate early genes (lEGs) indicates highly active 
cells within the entire brain, albeit with poor temporal resolution. We designed a 
fluorescent sensor, CaMPARI, that combines the genetic targetability and quantitative link 
to neural activity of GECIs with the permanent, large-scale labeling of lEGs, allowing a 
temporally precise “activity snapshot” of a large tissue volume. CaMPARI undergoes 
efficient and irreversible green-to-red conversion only when elevated intracellular Ca^^ 
and experimenter-controlled illumination coincide. We demonstrate the utility of CaMPARI 
in freely moving larvae of zebrafish and flies, and in head-fixed mice and adult flies. 


B rain function relies on patterns of synap- 
tic input and action potential firing, which 
are accompanied by transient changes in 
free intracellular calcium (Ca^"^) concen- 
tration ([Ca^^]) (i, 2). Genetically encoded 
calcium indicators (GECIs) are useful for moni- 
toring the activity of populations of neurons and 
synapses in behaving organisms (3-6). However, 
the transient nature of GECI responses follow- 
ing [Ca^^] rises requires continuous monitoring 
during behavior using sophisticated imaging 
equipment with limited fields of view, and often 
physical restraint to prevent brain movement. 
Alternatively, expression of immediate early genes 
(lEGs) such as Arc and cFos can be measured 
following free behavior (7) within several hours, 
but is only weakly correlated with neural elec- 
trical activity (8, 9) and is constrained neither 
by genetic cell type nor by a precise temporal 
window. The creation of molecular tools to al- 
low genetic targeting of direct reporters of neu- 
ral activity that can be rendered permanent 
for post hoc analysis across entire brains, and 
that only mark neurons active during short, 
user-defined behavioral epochs, would be trans- 
formative (10, 11). 

The GCaMP calcium indicator reports changes 
in free [Ca^^] [rise and decay times <1 s (i2)] 
through mechanisms that modulate the envi- 
ronment of the 488-nm-absorbing, fluorescent 
4-(p-hydroxybenzylidene)-5-imidazolinone chro- 
mophore of circularly permuted green fluores- 
cent protein (cpGFP) (13). The photoconvertible 
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fluorescent protein (FP) EosFP is a bright green FP 
that irreversibly converts to a bright red fluorescent 
species {backbone cleavage produces 2-[(lE)-2- 
(5-imidazolyl)ethenyl]-4-(p-hydroxybenzylidene)-5- 
imidazolinone} upon illumination with violet light 
(14). Previously, circular permutation of a photo- 
convertible FP and attachment to the Ca^^-binding 
protein calmodulin (CaM) and its associated M13 
peptide (M13) gave rise to a photoconvertible 
GECI that produced GCaMP-like rises in fluo- 
rescent intensity both in the unconverted, green 
state and in the converted, red state (IS). We rea- 
soned that related constructs would undergo 
Ca^'^-dependent allosteric modulation of the chro- 
mophore photoconvertibility. Such a protein spe- 
cies would convert from green to red only in the 
simultaneous presence of high [Ca^^] and user- 
supplied violet light (Fig. lA). By combining 
library screening and structure-guided muta- 
genesis (16) we produced such a protein, which 
we named CaMPARI (calcium-modulated pho- 
toactivatable ratiometric integrator). 

CaMPARI has the primary structure depicted 
in Fig. IB and fig. SI, and includes a nuclear ex- 
port signal (NFS) to exclude it from the nucleus 
when expressed in eukaryotic cells. CaMPARI 
exhibits 60 and 40% of the green and red fluo- 
rescence brightness of mEos2, respectively, and 
photoconverts to the red form 21 times as fast in 
the presence of calcium (Fig. 1C and table SI). 
Photoconversion (PC) of the calcium-bound and 
calcium-free states of CaMPARI occurs three 
times and one-seventh as fast, respectively, as 
that of the parent EosFP variant (fig. S2). No 
effect of calcium on PC of the parent EosFP var- 
iant was observed (fig. S2). 

Measuring the PC rate while titrating cal- 
cium gave an apparent dissociation constant 
(Xd) of 128 ± 6 nM (Fig. ID). We modified the 


affinity of CaMPARI for calcium via mutagen- 
esis at the CaM/M13 interface (fig. S3), thus 
increasing dynamic range in cells with a wide 
range of baseline and peak calcium concentra- 
tions. In addition to a calcium-dependent green- 
to-red PC rate, the fluorescence of both green and 
red CaMPARI states is also calcium-dependent, 
decreasing by a factor of 9 and 23, respectively, 
upon calcium binding (Fig. IE, fig. S4, and table SI). 

After initial characterization in cultured HeLa 
cells (fig. S5), we expressed CaMPARI in primary 
cultured rat hippocampal neurons. Whereas a 
2-s pulse of PC light (1.5 W/cm^) resulted in 
only minimal CaMPARI PC in unstimulated neu- 
rons, the same light exposure induced significant 
PC in field-stimulated neurons (Fig. 1, F to H), 
with a roughly linear correspondence between 
the frequency of field stimulation and the extent 
of CaMPARI PC (Fig. II). Stimulation of CaMPARI- 
expressing neurons without exposure to 405-nm 
light led to transient decreases of its green fluo- 
rescence (fig. S6), but with no conversion to the 
red form. We additionally depolarized neurons 
either by increasing extracellular potassium con- 
centration or via pharmacogenetic activation 
of exogenously expressed P2X channels with 
adenosine 5 "-triphosphate (ATP), both in con- 
cert with application of PC light. Cells were fixed 
with 4% paraformaldehyde (PFA) and colabeled 
with traditional antibody-staining techniques 
(Fig. 1, J and K). The green and red fluorescence 
of CaMPARI each decreased -50% upon PFA 
fixation (fig. S7). Retention of the CaMPARI PC 
signal through fixation, permeabilization, and 
multicolor antibody labeling enables integrated 
Ca^^ concentrations within individual cells to 
be directly correlated to independent proteomic 
metrics typically used for post hoc estimation 
of cell-type identification and examination of a 
wide range of biochemical events related to plas- 
ticity and other key signaling cascades. We co- 
immunolabeled fixed neurons for P2X expression 
level (fig. S8, an indication of the magnitude of 
the exogenous pharmacogenetic manipulation) 
and phosphoiylated nuclear cyclic adenosine 3",5"- 
monophosphate (cAMP)-responsive element- 
binding protein (pCREB), which increased with 
elevated intracellular calcium (Fig. IL). CaMPARI 
signal after fixation was similar to that of conven- 
tional immunofluorescence and in situ hybrid- 
ization markers, making it possible to image a 
direct measure of Ca^^ in parallel with proteomic 
markers that can only be accessed after fixation. 

We next used CaMPARI to mark active neu- 
rons in vivo, during sensory stimuli or behavior, 
in four preparations of three model organisms: 
larval zebrafish and Drosophila melanogaster 
to demonstrate whole-brain and genetically 
targeted neural ensemble mapping during free 
behavior; mouse primary visual cortex (VI) for 
large-scale post hoc confirmation of known 
distributions of direction-selective cells in a mam- 
malian brain; and adult Drosophila to highlight 
“labeled lines” of synaptically connected ensem- 
bles following naturalistic or optogenetic stim- 
ulation of sensory pathways. We chose calcium 
affinity variants (CaMPARI in mouse and adult 
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Drosophila', V398D, with about one-seventh the 
affinity for in larval and adult Drosophila', 
W391F+V398L, with about one-half the affinity 
for in larval zebrafish; fig. S3) based on per- 
formance of different GCaMP variants in these 
preparations, as well as literature estimates of 
intracellular calcium. 

To demonstrate the utility of CaMPARI for 
whole-brain neural circuit marking in larval 
zebrafish {Danio rerio) during free behavior, we 
generated stable transgenic zebrafish expressing 
CaMPARI in all neurons from the elavlS promo- 
ter {16) (Fig. 2A). After exposure of larvae (4 to 
5 days after fertilization) to 10 s of PC light (405-nm 
light-emitting diode array, 400 mW/cm^, Fig. 2B), 
confocal stacks were acquired to generate a 
cellular-resolution snapshot of the calcium state 
of all neurons in the zebrafish brain (Fig. 2C 
and movie SI). Exposure of larvae anesthetized 
with tricaine methanesulfonate (MS-222) to PC 
light resulted in a lack of PC throughout the 
brain (Fig. 2C). Freely swimming fish with no 
additional stimulation showed a large amount 
of PC in the forebrain and habenula, but very 
little in the optic tectum. Additionally, patterns 
of hindbrain activity consistent with motor out- 
put for swimming {17, 18) were often observed. 
Treatment of larvae with proconvulsive com- 
pounds such as 4-aminopyridine (4-AP), expo- 
sure to noxious heat or cold stimuli (Fig. 2C), or 
exposure to turbulent water (fig. S9) during the 
PC light pulse resulted in qualitatively different 
patterns of CaMPARI PC throughout the brain, 
consistent with permanent marking of the subset 
of neurons activated by these stimuli. CaMPARI 
signals following each stimulus were consistent 
between fish (figs. S9 and SIO) and showed clear 
cellular resolution (Fig. 2C, bottom panels). 

Next, we tracked activity in a freely moving 
organism with CaMPARI expression isolated to a 
genetically specified subset of neurons. We ex- 
pressed CaMPARI in peripheral sensory neurons 
(PSNs) of Drosophila melanogaster larvae using 
the P0163-Gal4 line {19). In third-instar larvae, 
CaMPARI fluorescence was clearly visible in PSN 
axonal projections in the ventral nerve cord (VNC) 
and could be segmented into proprioceptive, chor- 
dotonal, and nociceptive axon terminals (Fig. 2E). 
To highlight PSN responses to vibrational stimuli 
{20), we administered a single 5-s pulse of PC 
light to freely crawling larvae on an agar plate 
during presentation of a 1000-Hz tone delivered 
from a speaker below the plate (Fig. 2F). The vi- 
bration stimulus produced a twofold increase 
in green-to-red PC only within the axonal projec- 
tions of chordotonal neurons, not the axons of 
proprioceptive or nociceptive neurons (Fig. 2G). 
The extent of CaMPARI PC within chordotonal 
axons increased in a dose-dependent manner 
with increasing amplitude of the vibration stim- 
ulus, with more sensitive responses in lateral 
compared with ventromedial axon terminals 
(Fig. 2H). 

Within layer 2/3 of mouse VI, neurons respon- 
sive to different directions and orientations of 
moving bar gratings are interspersed through- 
out the tissue {21). CaMPARI was delivered to 
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Fig. 1. CaMPARI engineering and in vitro characterization. (A) Schematic of CaMPARI function. (B) 
Primary structure of CaMPARI with mutations relative to mEos2. (C) Time course of red fluorescence 
appearance during exposure of CaMPARI to PC light. Lines represent single exponential fits to the PC 
time course, n-3 measurements. All panels show means ± SEM. (D) CaMPARI PC rate as a function of 
free Ca^"^. Black line represents a sigmoidal fit. n = 3 measurements. (E) Relative fluorescence of 
CaMPARI as a function of free Ca^^ concentration. Lines are sigmoidal fits to the data, n = 3 mea- 
surements. (F and G) Fluorescence of primary rat hippocampal neurons expressing CaMPARI after a 
2-s PC light pulse without and with 80-Hz field electrode stimulation. (H) Color scale for composite 
micrographs of green and red CaMPARI fluorescence used throughout this work. (I) Quantitation of 
the red/green fluorescence ratio of neurons stimulated at different frequencies during PC light. Black 
line is a linear fit to the data, n = 10 measurements. (J) Fluorescence from two fixed, immunostained 
neurons. (K) Individual color channels from the same field of view as in (J). (L) Correlation between 
intensity of phospho-CREB stainingand CaMPARI PC for high-potassium depolarization (90 mM K^), 
controls (4 mM K^), and P2X/ATP. A range of expression levels of P2X channels was present across 
neurons, resulting in the observed spread in activation upon 100 iiM ATP stimulation. Scale bars: 50 gm 
in all panels. 
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layer 2/3 pyramidal cells with an adeno-associated 
virus 1 (AAVl)-synapsinl virus. Expression was 
bright and excluded from the nucleus (Fig. 3B). 
First, in vivo calcium imaging was performed 
through a cranial window by imaging two-photon 
excited CaMPARI fluorescence as moving grat- 
ings were presented to the contralateral eye (16) 
(Fig. 3A). This allowed generation of orientation- 
tuning maps of segmented cell bodies. After iden- 
tification of cells responsive to specific orientations 
(Fig. 3C), the visual stimulus (a single direction— 
“northwest,” preferred by cell 1 but not cell 2 in 
Fig. 3B) was replayed, and PC light was delivered 
through the cranial window in 500-ms pulses. 
After 20 PC pulses spaced 12 s apart, noticeable 


red fluorescence was present in a subset of 
cells (cell 1, but not cell 2, in Fig. 3B and movie 
S2). Neurons responsive to the “northwest” 
moving grating direction (like cell 1 in Fig. 3) 
exhibited a red/green fluorescence ratio signif- 
icantly higher than cells not responsive to that 
direction (like cell 2 in Fig. 3, fig. Sll). We re- 
located the in vivo two-photon fields of view 
within fixed, sectioned tissue using a confocal 
microscope (Fig. 3, B and E, and fig. S12). The 
relative red/green ratio of cells was maintained 
after fixation (fig. S13), and we were additionally 
able to image the CaMPARI signal over a much 
larger volume (Fig. 3F) than would have been 
easily accessible through the cranial window 


in vivo. When perfusion and fixation occurred 
24 hours after PC, marked cells could still be 
easily distinguished in fixed sections with con- 
focal microscopy (fig. S14). 

Genetic tools in Drosophila do not include 
methods for trans-synaptic circuit tracing and 
whole-brain functional mapping. Although pho- 
toactivatable GFP (22) and optogenetics in 
combination with calcium imaging (23) have 
been employed to detect potential neuronal con- 
nections, these methods require knowledge of 
presynaptic neurons and operate on a limited 
field of view. We set out to test whether CaMPARI 
could provide a more comprehensive solution. We 
affixed intact flies with pan-neuronal expression 
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Fig. 2. Response in freely moving organisms. (A) CaMPARI expression in 
the Tg[e/av/3;CaMPARI(W391F+V398L)]^^® zebrafish. Dashed black rectan- 
gle represents the region displayed in the top panels of (C). (B) Schematic 
of experimental configuration. (C) Confocal images of zebrafish larvae (4 to 
5 days after fertilization) after 10 s of PC light, applied during exposure to 
different conditions (labels above: MS-222 is tricaine, a sodium channel 
blocker: 4-AP is 4-aminopyridine, a potassium channel blocker: noxious heat 
is exposure to 45°C water; noxious cold is exposure to 4°C water). Top panels 
are maximum-intensity Z projections (MIPs) of the entire brain. Dashed gray 
box in the top left panel represents the region displayed in the bottom panels. 
Bottom panels are individual Z slices from the same fish, illustrating individual 
cells and neuropils within portions of the optic tectum (OT), cerebellum (CB), 


and hindbrain (HB). (D) Schematic of labeled peripheral sensory neurons 
(PSNs) in a Drosophila larva. Axons (green lines) project from the body wall to 
the brain and branch along tracts within the VNC. Dashed gray box represents 
the region displayed in (E). (E) Confocal images of CaMPARI fluorescence in 
a subset of neurons in the VNC, driven by P0163-Gal4. Rectangles are seg- 
mented axonal tracts. P, proprioceptive: C, chordotonal: and N, nociceptive. 
(F) Schematic of the experimental setup. (G) Quantitation of the red/green 
CaMPARI fluorescence in sensory neuron projections. Mean ± SEM from n = 
9 larvae in each condition (**P = 0.003; n.s., not significant, P > 0.4; Student’s 
t tests). (H) Dose response in chordotonal axons. Mean ± SEM from n = 6 
to 10 larvae in each condition. Scale bars: 500 gm in (A), 100 gm in (C), and 
20 iim in (E). 
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of low-affinity CaMPARI (V398D) to a phys- 
iology holder via the head cuticle and delivered 
PC light through a water-dipping objective while 
stimulating the animals with a panel of odors 
(Fig. 4A and fig. S15): (i) 3-octanol (3-Oct), which 
activates multiple glomeruli in the antennal lobe 
[AL; see {16) for a list of anatomical abbreviations 
used] {24); (ii) geosmin (Geo), an odorant that 
specifically activates the DA2 glomerulus (25); 
and (hi) phenylacetic acid (PAA), which strongly 
activates ionotropic receptor neurons projecting 
to the VL2a glomerulus {26). CaMPARI mapping 
confirmed the known neural representations 
of these odors in the antennal lobe (Fig. 4C, figs. 
S16 and S17, and movies S3 to S8). We also found 
additional, previously undescribed represen- 
tations of some odors (Fig. 4, D and E). In the 
previous Geo study (25), a specific driver, GH146- 
Gal4, was used to express GCaMP3 in a limited 
subset of cells, facilitating segmentation of live- 
cell imaging data at the cost of ensemble un- 
dersampling. We similarly saw only the DA2 
glomerulus activated when we drove CaMPARI 
expression with GH146-Gal4 (fig. S18), but ad- 
ditionally observed the DCl, VAld, VAlv, and VM7 
glomeruli (Fig. 4, D and E) with pan-neuronal 
CaMPARI. In the case of PAA, the original study 
(26) assessed odor responses using electrophysiol- 
ogy from specific Ir84a receptor neurons project- 
ing to a single glomerulus (VL2a), rather than 
sampling the entire AL. When pan-neuronal 
CaMPARI was used, VL2a was the most salient 
responder, but weaker PC of VAlv and VM7 was 
also seen. Increasing the stimulation and PC time 
increased the overall red/green ratio without 
altering the relative amount of signal in differ- 
ent glomeruli (Fig. 4F). 

Olfactory receptor neuron (ORN) responses 
likely contribute substantially to the activation 
patterns we observe in the AL with the broad 
R57C10-Gal4 driver. Although local neurons (LNs) 
likely contribute to AL labeling as well, we found 
evidence of projection neuron (PN) activation 
(fig. S19). The activation of these secondary neu- 
rons was clearer when we used the PN-specific 
GH146-Gal4 (figs. S18, S20, and S21). With GH146- 
Gal4, we also found photoconverted PN axons 
projecting to the mushroom body (MB) calyx 
(CA) and the lateral horn (LH) (figs. S20 and S21 
and movie S6), confirming that CaMPARI en- 
ables trans-synaptic circuit mapping. The labeled 
axons in LH arborize primarily ventrally (fig. S21), 
consistent with previous reports (26). Thus, 
pan-neuronal expression enables large-scale un- 
biased ensemble mapping across the brain. Co- 
registration of active cells to standardized atlases 
or the use of specific drivers to identify activ- 
ated neurons (27) could be a powerful strategy 
for mapping pathways involved in specific 
behaviors. 

We reasoned that sensory-driven circuit map- 
ping with CaMPARI could also be complemented 
by the use of optogenetic stimulation to activate 
circuits downstream of an arbitraiy cellular point 
of entry. We expressed the red light-sensitive chan- 
nelrhodopsin CsChrimson {28) in Ir84a receptor 
neurons (fig. S22) using a LexA driver while ex- 


pressing CaMPARI pan-neuronally with a Gal4 
driver. We then mapped active circuits through- 
out the brain during optogenetic stimulation of 
this single-receptor neuron type (Fig. 4G). In ad- 
dition to the glomeruli in the antennal lobe, where 
axons of ORNs and dendrites of PNs synapse 
(Fig. 4H), we observed axons of PNs projecting 
to the posterior areas of the brain (Fig. 4, 1 to L). 
One axon commissure, the medial antennal lobe 
tract (mALT), targets both CA and LH (Fig. 4, 1 
and K), whereas another commissure, the medial- 
lateral antennal lobe tract (mlALT), targets LH 
directly (Fig. 4, 1 to L). As with PAA odor-driven 
activation, photoconverted PN axons mostly 
target the ventral part of LH. In addition, we 
detected activity in the MB vertical lobe (VL, Fig. 
4, J and L) and VLP (Fig. 4, 1 and K), which are 
putative postsynaptic partners of PNs (Fig. 4J 
and fig. S23). More specifically, the a' and P' 
MB lobes were more photoconverted than a 
and p (fig. S24, E, F, G, and I compared to D), 
consistent with previous reports (29, 30). Using 
the higher-affinity CaMPARI, we attempted to 
mark circuit components further downstream 
(Fig. 4, K and L, and fig. S25). Although we found 
similar patterns of active PNs and VLP neurons 
as before (Fig. 4K), we also found other putative 
downstream circuit components such as LH 
neurons (Fig. 4L, cyan arrows), mushroom body 


output neurons (Fig. 4, J and L, and fig. S24), and 
descending neuron projections to the ventral 
nerve cord (fig. S25). Activation of deeper layers 
of the circuit suggests that CsChrimson may be 
able to drive the firing rates (and thus Ca^"^ 
levels) of sensory neurons and downstream 
neurons more potently than natural stimuli. 

In summary, CaMPARI was validated in four 
preparations in three organisms. Recording neu- 
ral circuit activity free from restraint creates op- 
portunities for studying complex behaviors such 
as social interaction, courtship, and so forth. In 
the case of direction-selective L2/3 pyramidal 
cells in mouse VI, CaMPARI labeling is consist- 
ent with results obtained from imaging of GECIs 
{12) and small-molecule dyes {31), as well as elec- 
trical recordings {21). In freely moving Drosophila 
larvae, vibration-evoked signals were restricted 
to chordotonal, but not proprioceptive or no- 
ciceptive, sensory neurons; the results show a 
functional difference in sensitivity of two dis- 
tinct subtypes of chordotonal axons to vibration. 
In adult Drosophila brains, PC during both nat- 
uralistic and optogenetic stimulation of olfactory 
sensory pathways labeled activity in the ex- 
pected OSN/PN circuit and, further, labeled PN 
projections to the LH and MB, and potential 
quaternary descending neurons in the VNC. In 
freely swimming zebrafish, activation patterns 
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Fig. 3. Response in mouse primary visual cortex. (A) Schematic of experimental setup. (B) Two- 
photon fluorescence from cortical layer 2/3 of VI after visual stimulus and PC light pulses. Two cells are 
circled and labeled for reference. (C) Calcium imaging fluorescence traces of the same two cells in 
response to different directions of drifting gratings (arrows and lines above traces). Average baseline- 
normalized green CaMPARI fluorescence {F/Fq) from five trials is in green; individual trial traces are light 
gray. Vertical gray bars represent periods of drifting grating visual stimulus. Vertical dashed lines show the 
timing of 500-ms PC light pulses during the “northwest” stimulus. (D) CaMPARI Fred//^green of cells 
according to functional properties identified by calcium imaging. “PC tuned”: cells with a significant 
(analysis of variance test, P < 0.01) calcium imaging response component to “northwest” gratings, (n = 
15): “responsive, not PC tuned”: cells with a significant response to at least one direction, but no response 
to “northwest” (n = 39): “non-responsive”: no significant response during any grating orientation (n = 
248) (***P< 0.001: n.s., not significant: P = 0.13: Mann-Whitney U tests). (E) Confocal image of fixed, sectioned 
tissue showing the same field of view as in (B). (F) Stitched composite confocal image of fixed, sectioned tissue. 
The region shown in (E) is outlined with a dashed gray box. Scale bars: 20 iim in (B) and (E), 200 iim in (F). 
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are consistent with hindbrain premotor and spi- 
nal cord motor output, as well as sensory modality- 
specific responses, which will be followed up 
experimentally. In all in vivo test cases, CaMPARI 
validates known results and offers extensions not 
otherwise possible: free movement, whole-brain 
activity mapping, a permanent signal compatible 


with fixation, and follow-up experiments such as 
targeted electrical recordings, multichannel im- 
munohistochemistry (allowing cell type identifi- 
cation), and isolation and genetic profiling of 
labeled cells. These properties facilitate “for- 
ward functional neuromics,” in which active 
neural populations are labeled during behav- 


ior, drug treatment, sensory stimulation, and so 
forth in an unbiased fashion, with subsequent 
validation and characterization of labeled cells. 

CaMPARI represents a new class of geneti- 
cally encoded indicator that enables light-gated 
integration of Ca^^ flux in vivo with a time res- 
olution of seconds to hours, specified by the user. 
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Fig. 4. Functional mapping and circuit tracing in adult 
Drosophila. (A) Schematic of experimental configuration. 

PC light is directed through an objective onto the head cuticle 
of a head-fixed adult fly during odor (C to F) or optogenetic (G 
to L) stimulation. (B) Schematic of olfactory circuitry tar- 
geted. The central brain of an adult fly is represented as a 
gray outline. ORNs [see (J6) for a list of anatomical abbre- 
viations used] (green) expressing specific receptors con- 
verge onto cognate glomeruli in the AL. PNs (red) from these 
glomeruli target the CA and LH. Putative tertiary neurons 
(cyan) then project to several areas including VL/SIR SLR 
VLP and finally to DN (orange), into the VNC. Dashed black 
box represents the area depicted in (C), (D), and (E). (C) MIPs of CaMPARI 
fluorescence from the left AL in response to indicated odors during PC. (D) MIP 
of red/green fluorescence ratios from (C). Dotted lines outline responsive 
glomeruli. (E) Maps of odor-response patterns in the AL. Each colored shape 
outlines one glomerulus from (C) and (D). Filled red glomeruli were previously 
reported in literature as responsive to that odor. Glomeruli that could be 
identified are labeled. (F) Gomparison of glomerulus-specific responses to PAA 
at different PG and stimulation durations (100 s, n = 4; 200 s, n = 4; 300 s,n = 
3). Means ± SEM. (G to L) GsGhrimson is expressed in specific ORNs while 
CaMPARI is expressed pan-neuronally. (G) Schematic showing the proposed 
connectivity of the olfactory circuit analyzed. (H) Response pattern in AL 


K LexAop-CsChrimson-mVenus ; UAS-CaMPARI ; R57C10-Gal4 
w lr84a-LexA + 



OceIJi 

1 

VL 

Posterior 

SLR 

MBONs [ 

1 

y 

^ '‘^LH 

y 

1 

1 

1 

1 

1 

FED 

y 

A 1 



ORNs' 

1 



Id 


OL 


shown with MIP; Ir84a ORNs are colabeled with mVenus. lr84a-LexA.VP16 
drives ORNs innervating ventromedial and dorsomedial glomeruli (see supple- 
mentary materials) that are photoconverted. (I) MIP of posterior sections 
showing marked PN axons targeting CA via the mALT bundle and LH through 
both the mALT and mlALT bundles. Also shown are labeled putative tertiary 
neurons in VLP. (J) MIP of more medial sections [relative to (I), same brain] 
showing the mlALT tract of PNs marked. (K and L) Experiments using high- 
affinity CaMPARI. (K) Similar to (H). (L) MIPof more medial sections [relative to 
(K), same brain] showing marked primary (ORNs, green arrows), secondary 
(PNs, red arrow), and putative tertiary (cyan arrows) neurons after optogenetic 
activation-PC. Scale bars: 25 |im in (C) and 50 |im in (H) to (L). 
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Converted signal is persistent, amenable to mul- 
titrial accumulation, and compatible with fixa- 
tion for post hoc whole-brain (or targeted-area) 
activity reconstruction. CaMPARI signal is ratio- 
metric, correcting for expression-level differences. 
The protein engineering principles applied here 
could be extended to allow permanent marking 
of cell states or analytes other than Ca^"^ by fu- 
sion of circularly permuted Eos variants to var- 
ious ligand-binding or sensor domains, similar 
to previous GFP-based fluorescence intensity 
sensors (32-34). 
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MAMMALIAN EVOLUTION 

Evolutionary development in basal 
mammaliaforms as revealed by 
a docodontan 

Zhe-Xi Luo/* Qing-Jin Meng/* Qiang Ji/ Di Liu/ Yu-Guang Zhang/ April I. Neander^ 

A new Late Jurassic docodontan shows specializations for a subterranean lifestyle. It 
is similar to extant subterranean golden moles in having reduced digit segments as 
compared to the ancestral phalangeal pattern of mammaliaforms and extant mammals. 
The reduction of digit segments can occur in mammals by fusion of the proximal and 
intermediate phalangeal precursors, a developmental process for which a gene and 
signaling network have been characterized in mouse and human. Docodontans show a 
positional shift of thoracolumbar ribs, a developmental variation that is controlled by Hox9 
and Myf5 genes in extant mammals. We argue that these morphogenetic mechanisms of 
modern mammals were operating before the rise of modern mammals, driving the 
morphological disparity in the earliest mammaliaform diversification. 


M ammaliaforms consist of modem Mam- 
malia and their fossil kin of the Mesozoic 
(1-3). We report a new Late Jurassic 
mammaliaform (Fig. 1) of the extinct 
docodontan clade that lived on Laurasia 
during the Mesozoic (2-6). It shows a reduction 
of segments in each digit and a positional shift in 
lumbar ribs, for which genetic mechanisms of 
morphogenesis have been characterized in mod- 
ern mammals. This suggests that developmental 
mechanisms played a role in the evolution of 
disparate morphologies, which are associated 
with versatile functional adaptations and paleo- 
ecological diversities in docodontans. 

Docofossor hrachydactylus, gen. et sp. nov. 
(7), is preserved in part and counterpart 
(BMNH131735A and B) (Fig. 1 and fig. SI), with 
a skull length of 22 mm, a rostmm-pelvis length 
of 90 mm, and an estimated body mass of 13 to 
17 g (7). It has a dentition of I4.Cl.P3.M3/i4.cl. 
p3.m4 and shows mesiodistally short and trans- 
versely wide upper molar crowns known as 
“zalambdodont” (7) (Fig. 2). This is typical of 
mammals foraging underground, such as marsupi- 
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al moles, golden moles, some tenrecs (8, 9), and 
Necrokstes of the extinct meridiolestidans (9, 10). 
The rostrum has a complete premaxillary in- 
temarial process and septomaxillaiy bone, which 
are primitive features in early mammaliaforms, 
including other docodontans (2-4). Its protrud- 
ing rostrum is common for fossorial mammals 

dh 12). 

Docofossor has many features indicative of 
scratch digging and subterranean living (7, 11-13), 
such as a hypertrophied olecranon process of the 
ulna (the olecranon-to-ulna length index is 47%) 
(11). It is similar to fossorial monotremes in 
possessing a sprawling posture of both limbs 
(Fig. 1) and a tibioastragalar (upper ankle) joint 
adapted for a habitually abducted foot (Fig. 3). 
In monotremes, the hindlimb shows a perma- 
nently flexed knee joint, which is constrained by 
the enlarged parafibula posterior to the knee. 
Docofossor likely had a similar sprawling pos- 
ture constraint by the parafibula (Fig. 1C) (14, 75). 
The curved tibia has a pivotal tibioastragalar 
joint in which the tibial distal malleolus ar- 
ticulates with the medial concavity of the tro- 
chleated astragalus. The malleolus serves as a 
pivot around which the astragalus and the foot 
can rotate while the concave distal tibial facet 
glides over the medial trochlear crest of the 
astragalus (Fig. 3, F to H). This enables mono- 
tremes to habitually abduct their feet relative 
to the limb and allows a wider range of foot 
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Fig. 1. Mammaliaform 
D. brachydactylus. 

(A) Holotype part 
(BMNH131735A) (the 
counterpart is in 
fig. S2). (B) Exposed 
skull of BMNH131735A. 

(C) Reconstruction of 
Docofossor as fosso- 
rial, with a sprawling 
posture and capable of 
digging (lateral view). 

(D) and (E) Antero- 
lateral views of limbs. 




Fig. 2. Dentition of Docofossor. (A and B) Left upper dentition in occlusal and buccal views (M2 re- 
constructed from right). (C) Left lower teeth and mandible in buccal view. (D and E) Lower right teeth and 
mandible in occlusal and lingual view. More stereo images of teeth and mandibles are shown in fig. S3. 


abduction (Fig. 3, F to H). Docofossor has a 
similar trochleate astragalus with a medial con- 
cavity (Fig. 3, A to D), thus a habitually abducted 


foot capable of v^de range of hyperextension 
and eversion, for monotreme-like pedal posture 
and movement (Fig. 3). 


Docofossor has a reduced number of phalan- 
ges for hands (2-2-2-2-2) and feet (1-2-2-2-2), with 
only two segments in each digit of digit rays 2 
to 5. Shovel-like terminal phalanges are dorso- 
ventrally low but horizontally broad. Its dorsal 
surface projects posteriorly to form the bony stop 
against hyperextension, as in digging mammals 
(11). The flexor tubercle is massive and has a 
keyhole pit for the insertion of the flexor muscle. 
A single (not paired) enlarged sesamoid bone for 
the digital flexor muscle is preserved at the distal 
interphalangeal joints. Each digit ray has a single 
“middle” phalanx, and the metacarpal is shortened 
and wide, with combined length shorter than the 
terminal phalanx and strong hingelike joints, all 
typical of scratch-digging mammals {11, 12). 

Extant mammals have conserved identities of 
the thoracic versus lumbar vertebrae {16). The 
thoraco-lumbar (T-L) transition, as marked by 
vertebral features and by the presence or absence 
of ribs, has functional implications for locomo- 
tion (77) and respiration {18). The T-L transition 
is usually distinctive, with several lumbars lack- 
ing any ribs. These differences across the T-L tran- 
sition are controlled by homeobox genes Hox9 
and HoxlO {19). Ribs are developed from the lat- 
eral plate of mesoderm and are influenced both 
by these Hox genes and by myogenic factor {Myf) 
5 and 6 dovm-cascade in the gene network from 
Hox genes {20). 

The newly discovered Agilodocodon {21) and 
Docofossor reveal a positional shift of the lumbar 
ribs through the T-L transition among docodon- 
tans. Docofossor has relatively long lumbar ribs on 
three of the last four dorsal vertebrae (“lumbars”). 
Similarly, another docodont, Castorocauda, has 
lumbar ribs on the posterior dorsal vertebrae 
(“lumbars”) up to the ultimate dorsal (= dorsal 
22) and a gradational reduction of rib lengths on 
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dorsals 18 to 22 (5). In Q,ontrdiSt,Agilodocodon has 
no lumbar ribs on the four ultimate dorsal (‘lum- 
bar”) vertebrae, with a precipitous decrease in rib 
lengths on dorsals 16 to 18 and the posteriormost 
rib on dorsal 18 (fig. S2). Thus the T-L transition 
is more distinctive mAgilodocodon. 

Eutriconodontans and spalacotherioids of crown 
mammals are already known for similar patterns 
in shifting T-L boundaries and in shifting lumbar 
rib positions {14, 22). However, docodontans are 
a stem clade outside the crown mammals (2-7). 
The positional shifting of lumbar ribs and T-L 
vertebral identities in docodontans show that 
the morphogenesis associated with the gene net- 
work of Hox 9-10 and Myf 5-6 in extant mammals 
{19, 20) is operative not only in crown mammals 
but also in stem mammaliaforms, before the rise 
of modem mammals. 

The phalangeal formula (number of segments 
in each of digit rays 1 to 5) of Docofossor is re- 
duced to 2-2-2-2-2 for the manus and to 1-2-2-2-2 
for the pes, which is different from the 2-3-3-3-3 
formula for both hands and feet of the majority 
of mammaliaforms (including Castorocauda and 
Agilodocodon) and of advanced premammalian 
cynodonts {23-25). Golden moles of Africa (Chrys- 


ochloridae), as an exception to most extant mam- 
mals, have reduced phalanges, and their manual 
formula is 2-2-1-2-0 for Eremitalpa and Chrys- 
ospalax, and the pedal formula is 2-2-2-2-2 for 
Eremitalpa and Chrysochloris {25, 26), as re- 
confirmed by our analyses of computed tomog- 
raphy (CT) scans. Phalangeal reduction in golden 
moles is a striking parallel to phalangeal reduc- 
tion in Docofossor within the docodont group 
(Figs. 3 and 4). 

In synapsid evolution, the reduction of seg- 
ments in a digit ray can occur by a loss of one or 
more ancestral segments, the fusion of ancestrally 
separate segments, or both {24). During develop- 
ment in extant mammals, segmental reduction 
can occur by two related processes: (i) a mesen- 
chymal condensate precursor to a cartilaginous 
segment fails to develop, as in human brachy- 
dactyly (“short-fingeredness”) type A (BDAl), in 
which hypoplasia (underdevelopment) or apla- 
sia (absence) of the intermediate phalangeal seg- 
ment occurs across all five digits (27); or (ii) 
failure of the interzone— a region of flattened 
cells in higher density within the digit ray that is 
the precursor of the future joint. The interzone 
separates the presumptive segments during the 


stage of condensation of phalangeal templates. 
Phalangeal reduction by an interzone failure is 
well characterized in symphalangism (“fused 
segments”) of human and mouse, in which the 
proximo-intermediate phalangeal joint is not de- 
veloped because of the failure of the interzone to 
initiate or not to go through full differentiation 
(27, 28). 

African golden moles, with a subterranean 
lifestyle and superb digging adaptation, are an 
example of brachydactyly as a result of sympha- 
langism {26) (Fig. 4) (7): The early mesenchymal 
condensates are segmented in a digit ray. Digit 
rays 2 to 4 show three segments, starting var- 
iously at 9-, 10-, and 12-mm embryonic stages. 
However, the presumptive proximo-intermediate 
phalangeal joint fails to develop in later stages 
in these digits. As a result, the mesenchymal con- 
densate segments become fused in later devel- 
opment, although developmental timing (stages) 
of this fusion can be different among digit rays. 
At the 19-mm stage, segments of digit rays 2 to 4 
have reduced by fusing together the phalangeal 
segments that had been separate earlier {26). 

The gene and signaling network for normal 
phalangeal development, which can also cause 


Fig. 3. Foot of Docofossor. (A) 

Reconstructed right foot (dorsal 
view) of phalangeal formula 

1- 2-2-2-2. (B and C) Right 
astragalus in the dorsomedial 
and dorsal views (stereo images 
from CT). (D) Fore-aft limb rota- 
tion to the trochleate ankle joint, 
with the tibial malleolus as a pivot 
in the medial concavity of the 
astragalus. (E) Chrysochloris right 
foot (dorsal view) with formula 

2- 2-2-2-2 by fusion of the proximo- 
intermediate phalanges of digit 
rays 2 to 5 (25). (F) and (G) 
Ornithorhynchus disassembled 
astragalus and calcaneus 
(posteromedial view). (H) 

Pivotal limb-foot rotation by the 
tibial malleolus in the medial 
concavity of the astragalus and 
tibio-astragalar joint as a gliding 
track. (I and J) Tachyglossus 
astragalus and calcaneus 
(posteromedial view). The 
pivotal tibio-astragalar joint 

of docodontans is similar to 
those of monotremes. 
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phalangeal reduction by mutation or other defects, 
has been characterized by developmental exper- 
iments on model mice and from genetic studies 
of abnormal development of phalanges in hu- 
mans (27-30). The joint development is influ- 
enced by bone morphogenetic protein (BMP) 2 
and growth and differentiation factor (GDF) 5, 
which positively regulate cell proliferation (29), 
and by Notch signaling limiting overgrowth of 
differentiated chondrocytes in the interzone or 
presumptive joint (27-30). The latter phenotype 


structures become defective when mutations oc- 
cur in genes encoding BMPs and GDF-5 (29)', 
genes encoding the Wnt/p-catenin pathway and 
Indian Hedgehog (IHH) (27) that are linked with 
BMPs and GDFs; the gene(s) for the transforming 
growth factor- p (TGF-P) receptor; TGF-p signal- 
ing is essential for interphalangeal joint develop- 
ment and operates upstream of BMPs/GDFs 
(28)', or in NOG, which is antagonistic to BMPs, 
among other genes and signaling pathways. 
The shortening of phalangeal condensates also 


leads to the failure of joint formation in distal 
segments (27). 

These morphogenetic mechanisms also underlie 
the phalangeal embryogenesis for mammals as a 
whole, including the segmental reduction within 
each digit ray in golden moles. The single “mid- 
dle” phalanx of Docofossor (Fig. 4) is similar to 
the single “middle” segment fused from the proxi- 
mal and intermediate phalangeal precursors in 
the embryogenesis of golden moles (25, 26) (fig. 
S4). Given the similarity of golden moles and 


metacarpal and 
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Fig. 4. Evolutionaty development and morphological disparity of auto- 
pods of docodontans. (A and B) Fusion of phalangeal segments in the 
development of the golden mole Eremitalpa: aniagen of phalanges, metacar- 
pal, and distal carpal of digit ray 3 are distinctive at the 9-mm embryonic 
stage and become fused in the 12-mm embryonic stage and adult (26). 
Shown is an adult from CT scanning of Carnegie Museum (CM) specimen 
93155. (C) The digging “claw” of Eremitalpa is formed by a hypertrophied 
distal phalange and the fusion of intermediate and proximal phalanges. 


metacarpal, and the capitate (25, 26) (verified by CT scanning). (D) Cynodont 
Kayentatherium (23), (E) Ornithorhynchus, and (F) docodont Agilodocodon of 
ancestral phalangeal formula 2-3-3-3-3. (G) Docofossor with shortened 
phalanges of 2-2-2-2-2, convergent to some extant fossorial mammals. 
Embryogenesis of phalangeal joints in extant mammals is patterned by BMPs, 
GDF-5, and TGF-p (27-30). Mutation of these genes and signaling pathways 
can now be correlated with the shortening of digits (brachydactyly) by 
hypoplasia or aplasia of the middle phalanges in Mus knockouts (27). 
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Docofossor, we hypothesize that the gene and 
signaling pathways that control the morphogen- 
esis of phalanges in extant mammals probably 
caused similar changes in Docofossor, significant- 
ly expanding the known morphological disparity 
of phalanges among the earliest mammaliaforms 
(Fig. 4), well before the rise of modem mammals. 

The shortening and widening of digits and 
phalangeal reduction are major adaptations for 
digging and a subterranean lifestyle (11, 13, 2S), 
which are common in mammals and clearly con- 
vey major evolutionary advantage (11, 12). The 
reduction of phalangeal segments by morpho- 
genesis provides a mechanistic underpinning 
for the extensive convergent evolution of fossorial 
adaptation in fossil and extant mammaliaforms. 

Evolutionary biologist Leigh Van Valen famous- 
ly stated that “evolution is the control of develop- 
ment by ecology.” The new docodontan fossil 
reported here represents such an example of how 
developmental morphogenesis, which has become 
increasingly better understood in modem mam- 
mals, can be a driving mechanism of the evo- 
lution of lumbar vertebrae without ribs and the 
shortening and fusion of digital segments. Both 
features are significant for locomotion and eco- 
logical specialization during the earliest diversi- 
fication of mammaliaforms. 
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MAMMALIAN EVOLUTION 


D ocodontans are extinct early mammalia- 
forms without living descendants, but they 
were widespread and abundant in Laura- 
sia during the Mesozoic (1-9). Mammalia- 
forms bear key evidence of the earliest 
evolution of mammalian characteristics (4, 10-12) 
and of the morphological disparity and func- 
tional diversification during the early evolution 
of mammaliaforms (4, 13-16). We report a new 
docodontan from the Middle Jurassic of China 
that has many skeletal features adapted for climb- 
ing (Fig. 1), different from the digging and swim- 
ming adaptations seen in other docodontans. 
Further, it shows dental features for an herbiv- 
orous diet including gum or sap, distinctive from 
the omnivorous to faunivorous diets of other 
docodontans (1). This species expands the eco- 
logical diversity known for the docodontan group 
and provides further evidence that adaptive di- 
versification at the species level, as seen in the 
evolution of many extant mammal clades, oc- 
curred also within the earliest mammaliaform 
clades (17). 
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Agilodocodon scansorius, gen. et sp. nov. 
(18), is preserved in part and counterpart 
(BMNH001138A and B) (Fig. 1 and fig. SI), with 
a flattened skull (length at 27 mm) and a skel- 
eton (estimated head-tail length at 140 mm). Its 
body mass is ~27 g, based on scaling of body 
mass to lengths of jaw and limb bones, but it 
may weigh up to 40 g (18). It has a dentition 
of incisors, canine premolars, and molars of 
I4.Cl.P6.M4/i4.cl.p6.m4, and shows a typical di- 
phyodont tooth replacement (Fig. 2 and figs. 
S2 and S3). A. scansorius has many docodontan 
synapomorphies (18) but differs from other 
docodontans in three unique molar characters: 
an enlarged prestylar wing anterior to cusp A, 
two stylar shelf ridges that extend from the apex 
of cusp A, and two ridges that extend from the 
apex of cusp C to the labial cingulum. On the basis 
of molar features, Agilodocodon is more similar to 
Hvlegotheriurn (6), Krusatodon, Borealestes (3), 
and Simpsonodon (1) of the Middle Jurassic of 
Siberia and Europe. Although other docodontans 
have single-rooted, peglike lower incisors (1, 5, 9), 
Agilodocodon is unique in having incisors with 
incipiently divided roots. Their crowns are spat- 
ulate, mesiodistally broa4 and have the shape of 
a broad spade on a wide base (Fig. 2). The incisors 
are convex on the labial side and concave on the 
lingual side with a low median ridge and a lin- 
gual cingulid (Fig. 2, G to H, and fig. S4) (18). 


An arboreal docodont from the 
Jurassic and m ammal i af orm 
ecological diversification 

Qing-Jin Meng,^ Qiang Ji,^ Yu-Guang Zhang, ^ Di Liu,^ 

David M. Grossnickle,^ Zhe-Xi Luo^’^* 

A new docodontan mammaliaform from the Middle Jurassic of China has skeletal features 
for climbing and dental characters indicative of an omnivorous diet that included plant sap. 
This fossil expands the range of known locomotor adaptations in docodontans to include 
climbing, in addition to digging and swimming. It further shows that some docodontans 
had a diet with a substantial herbivorous component, distinctive from the faunivorous diets 
previously reported in other members of this clade. This reveals a greater ecological 
diversity in an early mammaliaform clade at a more fundamental taxonomic level not 
only between major clades as previously thought. 
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This type of incisor is similar to the incisors 
of some extant New World monkeys. Specif- 
ically, the spade-shaped and spatulate lower 
incisors are known from small marmosets, 
spider monkeys (Ateles), and howler monkeys 
(Aloutta). The small anthropoid primates with 
this type of incisor use the incisors to gnaw 
into the bark of trees to feed on exudates, such 
as gum and sap (19). This type of incisor suggests 
that Agilodocodon had a similar component 
in its diet. This is the earliest-known evidence 
of gumnivorous feeding in mammaliaforms, 
which had evolved separately among docodonts. 


and in convergence to those of extant primates. 
The crest pattern of the upper molars is analo- 
gous to those of galagid and some lorisid pri- 
mates that have a mixed diet of insects, other 
small animals, fruits, tree gums, and sap. The well- 
developed crests on the molars bear resemblance 
to those of lemurids, which have fruigivorous- 
insectivorous diets. 

Most docodontans have molars adapted for 
omnivorous-faunivorous diets (1) although Cas- 
torocauda has piscivorous (fish-eating) anterior 
molars and is large enough to feed on vertebrate 
prey (4). Agilodocodon diverged from previously 


knovm docodontans in having an omnivorous 
diet with a significantly greater amount of plant 
food. The gumnivorous adaptation of its anterior 
lower teeth shows the versatility of feeding and 
dietary ecology at the species level of this mam- 
maliaform clade in Middle Jurassic, akin to the 
dietary diversification of kuehneotheriid and 
morganucodont mammaliaforms during the Early 
Jurassic (16) and multituberculate mammals dur- 
ing the Late Cretaceous (20). 

We hypothesize that Agilodocodon was likely 
a fully arboreal mammaliaform, or at least a 
scansorial mammaliaform, on the basis of its 
many skeletal features adapted for climbing 
and on its distinction from other Jurassic doco- 
dontans (4, 21) and mammaliaforms in the ver- 
tebral column, limbs, and digits (Figs. 3 and 4) 
( 10 , 22 ). 

The shape and proportion of the hand, foot, 
and finger bones, as well as the homy claws, are 
informative about habits and the preference of 
substrate. InAgilodocodon, these bones are slender 
and gracile (Fig. 4 and figs. S4 to S6). The pha- 
langeal slenderness ratio (PSR: lengths/widths x 
100) is 328% for all proximal phalanges and 210% 
for all intermediate phalanges (23). These values 
are well vdthin the range of arboreal diproto- 
dontian marsupials, but they are above (for prox- 
imal phalanges) or near the upper limit (for 
intermediate phalanges) of terrestrial diproto- 
dontians (23). 

In Agilodocodon, the phalanges are elongate 
relative to the metacarpals. The phalangeal index 
[PI: (combined length of proximal + intermediate 
phalanges)/Gength of metacarpal) x lOO] is 137% 
for manual ray 3, typical of primates and arboreal 
marsupials, carnivorans, and rodents (24) but 
well above the range of this index for terrestrial 
mammals. Length ratios of the intermediate pha- 
lanx, proximal phalanx, and metacarpal of digit 
ray 3 are similar to those of primates (arboreal). 
The intrinsic proportions of hand bones of 
Agilodocodon (fig. S7) are closer in value to the 
arboreal than to the terrestrial marsupials and 
closer to the arboreal than to the terrestrial 
rodents (24). 

In phalangeal morphology (Fig. 4), Agilodoco- 
don is more similar to the arboreal flying lemur 
(Dermoptera: Cynocephalus) and the lemur (and 
other primates) than to the scansorial opossum 
(Marsupialia: Didelphis) and the terrestrial 
hedgehog (Lipotyphla: Erinaceus). By contrast, 
fossorial and subterranean docodontans, with 
their short proximal and intermediate phalanges 
but massive terminal phalanx are more compa- 
rable to the fossorial echidna (Monotremata: 
Tachyglossus) and the subterranean golden 
mole (Afrosoricia: Eremitalpa). 

The terminal manual phalanges are bilaterally 
compressed, dorsoventrally deep v^th an arched 
dorsal margin (Fig. 3 and fig. S6). The flexor 
tubercles appear to be massive on the bilaterally 
compressed phalangeal body, which indicates 
the insertion of well-developed digit flexor mus- 
cles. The shape and proportion of phalanges are 
typical of arboreal or scansorial rodents, and they 
differ from terrestrial rodents (25, 26). Compared 



Fig. 1. Skeleton of the new docodont mammaliaform Agiiodocodon scansorius. (A) Reconstruc- 
tion of Agilodocodon as an arboreal mammaliaform. (B) Outline and (C) photo of the holotype main 
part of Beijing Museum of Natural History (BMNH) 001138A: counterpart BMNH001138B shown in 
fig. SI (18). 
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with other docodontans, Agilodocodon is differ- 
ent from Haldanodon that has short, broad, and 
massive terminal phalanges for burrowing (or a 
semiaquatic lifestyle) {21), from the specialized 
swimming Castorocauda that has elongate, curved, 
but shallow, terminal phalanges (4), and from a 
new fossorial docodontan (BMNH131735) that 
has broad, flattened, and shovel-like terminal 
phalanges for digging in a subterranean niche 
(Fig. 4). The outline of horny claws is exquisitely 
preserved in organic residue (in distinctive white 
color) for several manual and pedal claws in 
Agilodocodon. Pedal horny claw 4 is twice as 
long as the terminal phalanx, laterally com- 
pressed, and strongly arcuate (Fig. 3). The horny 
claw of Agilodocodon is typical of mammals liv- 
ing on the tree or in the bushes. Agilodocodon, 
Haldanodon, Castorocauda, and the new doco- 
dont (BMNH131765) indicate a wide range of hab- 
its as evidenced by their diverse types of terminal 
phalangeal bones and claws. 

The hypothesis that Agilodocodon is a climb- 
ing mammaliaform is supported further by fea- 
tures of the distal humerus and the proximal 
part of the ulna, the ankle joint, and the lumbar 


and proximal caudal vertebrae {18). The distal 
humerus is gracile and narrow, in contrast to 
the robust and broad distal humeri associated 
with large muscles for burrowing and/or swim- 
ming in Castorocauda and Haldanodon (Fig. 3, 
H and I) {4, 21). The humerus has a small ect- 
epicondylar shelf for the supinator and extensor 
muscles to rotate the radius relative to the ulna 
and for extensor muscles of the antebrachium 
and the wrist. The entepicondyle for flexor mus- 
cles of the wrist is also better developed in 
Agilodocodon than in the mammaliaform Mor- 
ganucodon {27) (Fig. 3G). In didelphid marsu- 
pials {28) and tree shrew species {29), the 
supinator shelf and the entepicondyle tend to 
be better developed in species with better climb- 
ing capability than the terrestrial species of the 
same clades. 

Agilodocodon has a greater range of ankle 
mobility than other mammaliaforms. The foot 
has a side-to-side juxtaposition of the astrag- 
alus and the calcaneus (Fig. 3) with a short, ven- 
trally turned calcaneal tuber (fig. S7), which is 
also plesiomorphic in mammaliaforms Morga- 
nucodon and Megaconus {10, 27). However, the 


distal parts of the calcaneus and the astragalus 
(defined as distal to their sustentacular facets) 
are more elongate in Agilodocodon (Fig. 3 and 
fig. S7) than in Morganucodon and Megaconus. 
This allows a wider movement of the distal 
end of the astragalus and the navicular, rela- 
tive to the calcaneus and the cuboid, for better 
eversion, inversion, and flexion or adduction of 
pedes and greater abduction of the medial ped- 
al digit(s) relative to the lateral digit(s) than in 
Megaconus {10). 

The vertebral function of Agilodocodon is 
unique compared with mammaliaforms of the 
same Jurassic fauna that are terrestrial, fossorial, 
and swimming. The last four lumbar vertebrae 
(‘‘posterior dorsals”) have no lumbar ribs. The pos- 
terior dorsal vertebrae show a distinctive thoraco- 
lumbar transition, and a more distinctive transition 
corresponds to wider range of movement of the 
posterior vertebral column {30), as expected for a 
scansorial or arboreal mammaliaform living in 
bushes or on trees. MoveoYev, Agilodocodon differs 
from other mammaliaforms in the same paleo- 
ecological community {Castorocauda, Megaconus, 
and Pseudotribos) that show lumbar ribs and 



Fig. 2. Mandible and dentition of Agilodocodon. (A) and (B) Labial and occlusal views of left upper teeth in outline. (C) Left upper P6-M4 by stereoimages 
rendered from CT scans. (D) Occlusal view of lower teeth. (E) Labial view of left mandible and teeth. (F) Lingual view of mandible (image rendered from CT 
scans, omitting middle ear elements). (G and H) Lingual and labial views of spatulate, spade-shaped incisors. (I) Reconstructed mandible and middle ear 
bones, additional details in fig. S4 {18). (J) Occlusal view of left lower p6-m4, stereo images rendered from OT. Abbreviations: ep6-erupting p6: em4-erupting m4. 
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gradational thoracolumbar transition (10, 22). 
Castorocauda also has partially overlapping and 
widened ribs of posterior dorsal vertebrae up to 
the pelvis. The distinction of the thoracolumbar 
transition suggests a major difference in vertebral 
function between^4g^odocodo72 and Castorocauda, 
Megaconus and Pseudotribos. The first eight cau- 
dals of Agilodocodon are dorsoventrally flattened 
and transversely widened with prominent trans- 
verse processes, which indicate well-developed 


vertebral extensor (epaxial) muscles. The prox- 
imal seven caudals have synovial surfaces in 
zygapophyses and were mobile relative to each 
other. The highly mobile and well-muscularized 
tail, by itself, does not necessarily indicate an 
arboreal habit but is consistent with the arboreal 
locomotor adaptation (30). 

Agilodocodon is the first arboreal species 
among docodontans, and it differs in substrate 
preference from other terrestrial, digging, and 


swimming forms, adding to the diversity of life 
styles of docodonts (Fig. 4). Its dietary adapta- 
tion includes a gumnivorous component, which 
is unique among mammaliaforms in the Juras- 
sic fauna. The great range of ecological diversity 
of the docodontans provides further evidence 
that adaptive diversification at the species level, 
typical of many species of extant mammalian 
clades, also occurred within the earliest mam- 
maliaforms (17). 



Fig. 3. Comparative morphology of forelimb of Agilodocodon. (A) and (B) Humerus (right, distal part) in posterior and anterior views. (C) Right 
forelimb and manus (humerus in posterior view; manus in palmar view). (D) The intermediate and terminal phalanges of digit ray 3 in medial, ventral, 
lateral, and dorsal views. (E) Reconstructed right manus in palmar view. (F) Left pes, dorsal view. (G) Morganucodon from (26). (H) Distal humerus of 
Castorocauda. (I) Haldanodon from (20). 
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Fig. 4. Ecomorphological diversification of docodont mammaliaforms. (A) Phylogeny of docodontans— analysis presented in supplementary 
materials (18). (B) Agilodocodon as a tree-living mammaliaform. (C) Manus of Agilodocodon. (D) Castorocauda as a swimming mammaliaform. (E) 
Fossorial and subterranean docodont (BMNH131735) from Late Jurassic. (F) Phalangeal proportion of Agilodocodon compared with those of 
Cynocephalus (Dermoptera, arboreal). Lemur (Primates, arboreal), Didelphis [Didelphidae, scansorial according to (28)], Erinaceus (Eulipotyphia, 
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MARINE POLLUTION 

Plastic waste inputs from land into 
the ocean 

Jenna R. Jambeck/* Roland Geyer,^ Chris Wilcox,^ Theodore R. Siegler,^ 

Miriam Perryman/ Anthony Andrady/ Ramani Narayan/ Kara Lavender Law^ 

Plastic debris in the marine environment is widely documented, but the quantity of plastic 
entering the ocean from waste generated on land is unknown. By linking worldwide 
data on solid waste, population density, and economic status, we estimated the mass 
of land-based plastic waste entering the ocean. We calculate that 275 million metric 
tons (MT) of plastic waste was generated in 192 coastal countries in 2010, with 4.8 to 
12.7 million MT entering the ocean. Population size and the quality of waste management 
systems largely determine which countries contribute the greatest mass of uncaptured 
waste available to become plastic marine debris. Without waste management 
infrastructure improvements, the cumulative quantity of plastic waste available to enter 
the ocean from land is predicted to increase by an order of magnitude by 2025. 


R eports of plastic pollution in the ocean 
first appeared in the scientific literature 
in the early 1970s, yet more than 40 years 
later, no rigorous estimates exist of the 
amount and origin of plastic debris en- 
tering the marine environment. In 1975, the es- 
timated annual flux of litter of all materials to the 
ocean was 6.4 million tons [5.8 million metric 
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tons (MT)], based only on discharges from ocean 
vessels, military operations, and ship casualties 
(7). The discharge of plastic from at-sea vessels 
has since been banned (2), but losses still occur. 
It is widely cited that 80% of marine debris or- 
iginates from land; however, this figure is not 
well substantiated and does not inform the total 
mass of debris entering the marine environment 
from land-based sources. 

Plastics have become increasingly dominant 
in the consumer marketplace since their com- 
mercial development in the 1930s and 1940s. 
Global plastic resin production reached 288 
million MT in 2012 (3), a 620% increase since 
1975. The largest market sector for plastic res- 
ins is packaging (3); that is, materials designed 
for immediate disposal. In 1960, plastics made 
up less than 1% of municipal solid waste by mass 
in the United States (4); by 2000, this proportion 
increased by an order of magnitude. By 2005, 
plastic made up at least 10% of solid waste by 


mass in 58% (61 out of 105) of countries with 
available data (5). 

Plastics in the marine environment are of 
increasing concern because of their persistence 
and effects on the oceans, wildlife, and, poten- 
tially, humans (6). Plastic debris occurs on coast- 
lines, in Arctic sea ice, at the sea surface, and 
on the sea floor (7, 8). Weathering of plastic 
debris causes fragmentation into particles that 
even small marine invertebrates may ingest (9). 
Its small size also renders this debris untraceable 
to its source and extremely difficult to remove 
from open ocean environments, suggesting that 
the most effective mitigation strategies must re- 
duce inputs. 

We estimated the annual input of plastic to the 
ocean from waste generated by coastal popula- 
tions worldwide. We defined mismanaged waste 
as material that is either littered or inadequately 
disposed. Inadequately disposed waste is not for- 
mally managed and includes disposal in dumps 
or open, uncontrolled landfills, where it is not 
fully contained. Mismanaged waste could even- 
tually enter the ocean via inland waterways, 
wastewater outflows, and transport by wind or 
tides. Estimates of the mass of plastic waste car- 
ried by particular waterways range from <<1 kg 
per day (Hilo, HI) to 4.2 MT (4200 kg) per day 
(Danube River) {10, 11). Because of their depen- 
dence on local watershed characteristics, these 
results cannot be easily extrapolated to a global 
scale. 

Here we present a framework to calculate the 
amount of mismanaged plastic waste generated 
annually by populations living within 50 km of a 
coast worldwide that can potentially enter the 
ocean as marine debris. For each of 192 coastal 
countries with at least 100 permanent residents 
that border the Atlantic, Pacific, and Indian 
oceans and the Mediterranean and Black seas, 
the framework includes: (i) the mass of waste 
generated per capita annually; (ii) the percent- 
age of waste that is plastic; and (iii) the percent- 
age of plastic waste that is mismanaged and. 
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therefore, has the potential to enter the ocean 
as marine debris ( 12 ) (data SI). By applying a 
range of conversion rates from mismanaged 


waste to marine debris, we estimated the mass 
of plastic waste entering the ocean from each 
country in 2010, used population growth data 


( 13 ) to project the increase in mass to 2025, and 
predicted growth in the percentage of waste 
that is plastic. Lacking information on future 



Fig. 1. Global map with each country shaded according to the estimated mass of mismanaged plastic waste [millions of metric tons (MT)] generated 
in 2010 by populations living within 50 km of the coast. We considered 192 countries. Countries not included in the study are shaded white. 


Table 1. Waste estimates for 2010 for the top 20 countries ranked by mass of mismanaged plastic waste (in units of millions of metric tons per year). 

Econ classif., economic classification: HIC, high income: UMI, upper middle income: LMI, lower middle income: LI, low income (World Bank definitions based 
on 2010 Gross National Income). Mismanaged waste is the sum of inadequately managed waste plus 2% littering. Total mismanaged plastic waste is 
calculated for populations within 50 km of the coast in the 192 countries considered, pop., population: gen., generation: ppd, person per day: MMT, million 
metric tons. 


Rank 

Country 

Econ. 

classif. 

Coastal pop. 
[millions] 

Waste gen. 
rate 

[kg/ppd] 

% plastic 
waste 

% mismanaged 
waste 

Mismanaged 

plastic 

waste 

[MMT/year] 

% of total 
mismanaged 
plastic 
waste 

Plastic 

marine 

debris 

[MMT/year] 

1 

China 

UMI 

262.9 

1.10 

11 

76 

8.82 

27.7 

1.32-3.53 

2 

Indonesia 

LMI 

187.2 

0.52 

11 

83 

3.22 

10.1 

0.48-1.29 

3 

Philippines 

LMI 

83.4 

0.5 

15 

83 

1.88 

5.9 

0.28-0.75 

4 

Vietnam 

LMI 

55.9 

0.79 

13 

88 

1.83 

5.8 

0.28-0.73 

5 

Sri Lanka 

LMI 

14.6 

5.1 

7 

84 

1.59 

5.0 

0.24-0.64 

6 

Thailand 

UMI 

26.0 

1.2 

12 

75 

1.03 

3.2 

0.15-0.41 

7 

Egypt 

LMI 

21.8 

1.37 

13 

69 

0.97 

3.0 

0.15-0.39 

8 

Malaysia 

UMI 

22.9 

1.52 

13 

57 

0.94 

2.9 

0.14-0.37 

9 

Nigeria 

LMI 

27.5 

0.79 

13 

83 

0.85 

2.7 

0.13-0.34 

10 

Bangladesh 

LI 

70.9 

0.43 

8 

89 

0.79 

2.5 

0.12-0.31 

11 

South Africa 

UMI 

12.9 

2.0 

12 

56 

0.63 

2.0 

0.09-0.25 

12 

India 

LMI 

187.5 

0.34 

3 

87 

0.60 

1.9 

0.09-0.24 

13 

Algeria 

UMI 

16.6 

1.2 

12 

60 

0.52 

1.6 

0.08-0.21 

14 

Turkey 

UMI 

34.0 

1.77 

12 

18 

0.49 

1.5 

0.07-0.19 

15 

Pakistan 

LMI 

14.6 

0.79 

13 

88 

0.48 

1.5 

0.07-0.19 

16 

Brazil 

UMI 

74.7 

1.03 

16 

11 

0.47 

1.5 

0.07-0.19 

17 

Burma 

LI 

19.0 

0.44 

17 

89 

0.46 

1.4 

0.07-0.18 

18* 

Morocco 

LMI 

17.3 

1.46 

5 

68 

0.31 

1.0 

0.05-0.12 

19 

North Korea 

LI 

17.3 

0.6 

9 

90 

0.30 

1.0 

0.05-0.12 

20 

United States 

HIC 

112.9 

2.58 

13 

2 

0.28 

0.9 

0.04-0.11 


*lf considered collectively, coastal European Union countries (23 total) would rank eighteenth on the list 
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global infrastructure development, the projec- 
tion represents a business-as-usual scenario. 

We estimate that 2.5 billion MT of municipal 
solid waste was generated in 2010 by 6.4 billion 
people living in 192 coastal countries (93% of 
the global population). This estimate is broadly 
consistent with an estimated 1.3 billion MT of 
waste generated by 3 billion people in urban 
centers globally (5). Approximately 11% (275 mil- 
lion MT) of the waste generated by the total 
population of these 192 countries is plastic. We 
expect plastic waste to roughly track plastic 
resin production (270 million MT in 2010) (3), 
with differences resulting from the time lag in 
disposal of durable goods (lifetime of years to 
decades), for example. Scaling by the population 
living within 50 km of the coast (those likely to 
generate most of the waste becoming marine 
debris), we estimate that 99.5 million MT of 
plastic waste was generated in coastal regions 
in 2010. Of this, 31.9 million MT were classified 
as mismanaged and an estimated 4.8 to 12.7 mil- 
lion MT entered the ocean in 2010, equivalent to 
1.7 to 4.6% of the total plastic waste generated in 
those countries. 

Our estimate of plastic waste entering the 
ocean is one to three orders of magnitude greater 
than the reported mass of floating plastic debris 
in high-concentration ocean gyres and also glob- 
ally {14-17). Although these ocean estimates rep- 
resent only plastics that are buoyant in seawater 
(mainly polyethylene and polypropylene), in 
2010 those resins accounted for 53% of plastic 
production in North America and 66% of plas- 
tic in the U.S. waste stream (4, 18). Because no 
global estimates exist for other sources of plastic 
into the ocean (e.g., losses from fishing activities 
or at-sea vessels, or input from natural disasters), 
we do not know what fraction of total plastic 
input our land-based waste estimate represents. 



Fig. 2. Estimated mass of mismanaged plas- 
tic waste (millions of metric tons) input to 
the ocean by populations living within 50 km 
of a coast in 192 countries, plotted as a cumu- 
lative sum from 2010 to 2025. Estimates reflect 
assumed conversion rates of mismanaged plastic 
waste to marine debris (high, 40%: mid, 25%: low, 
15%). Error bars were generated using mean and 
standard error from the predictive models for mis- 
managed waste fraction and percent plastic in the 
waste stream (12). 


Our framework was designed to compute, from 
the best-available data, an order-of-magnitude 
estimate of the amount of mismanaged plastic 
waste potentially entering the ocean worldwide. 
It is also a useful tool to evaluate the factors de- 
termining the largest sources of mismanaged 
plastic waste. The amount of mismanaged plastic 
waste generated by the coastal population of a 
single country ranges from 1.1 MT to 8.8 million MT 
per year, with the top 20 countries’ mismanaged 
plastic waste encompassing 83% of the total in 
2010 (Fig. 1 and Table 1). Total annual waste gen- 
eration is mostly a function of population size, 
with the top waste-producing countries having 
some of the largest coastal populations. How- 
ever, the percentage of mismanaged waste is also 
important when assessing the largest contribu- 
tors of waste that is available to enter the en- 
vironment. Sixteen of the top 20 producers are 
middle-income countries, where fast economic 
growth is probably occurring but waste man- 
agement infrastructure is lacking (the average 
mismanaged waste fraction is 68%). Only two of 
the top 20 countries have mismanaged fractions 
<15%; here, even a relatively low mismanaged 
rate results in a large mass of mismanaged plas- 
tic waste because of large coastal populations 
and, especially in the United States, high per 
capita waste generation. 

Assuming no waste management infrastruc- 
ture improvements, the cumulative quantity of 
plastic waste available to enter the marine en- 
vironment from land is predicted to increase 
by an order of magnitude by 2025 (Fig. 2 and 
table SI). The predicted geographic distribu- 
tion of mismanaged plastic waste in 2025 does 
not change substantially, although the disparity 
between developing and industrialized countries 
grows (table S2). For example, mismanaged plas- 
tic waste in the United States increases by 22%, 
whereas in the top five countries it more than 
doubles. The increase in these middle-income 
countries results from population growth, waste 
generation rates for 2025 that are consistent 
with economic growth (5), and a projected in- 
crease in plastic in the waste stream. 

The analytical framework can also be used to 
evaluate potential mitigation strategies. For ex- 
ample, if the fraction of mismanaged waste were 
reduced by 50% (i.e., a 50% increase in adequate 
disposal of waste) in the 20 top-ranked countries, 
the mass of mismanaged plastic waste would 
decrease 41% by 2025. This falls to 34% if the 
reduction is only applied to the top 10 countries 
and to 26% if applied to the top 5. To achieve a 
75% reduction in the mass of mismanaged plas- 
tic waste, waste management would have to be 
improved by 85% in the 35 top-ranked countries. 
This strategy would require substantial infrastruc- 
ture investment primarily in low- and middle- 
income countries. 

Alternatively, reduced waste generation and 
plastic use would also decrease the amount of 
mismanaged plastic waste. If per capita waste 
generation were reduced to the 2010 average 
(1.7 kg/day) in the 91 coastal countries that 
exceed it, and the percent plastic in the waste 


stream were capped at 11% (the 192-country av- 
erage in 2010), a 26% decrease would be achieved 
by 2025. This strategy would target higher-income 
countries and might require smaller global in- 
vestments. With a combined strategy, in which 
total waste management is achieved (0% mis- 
managed waste) in the 10 top-ranked countries 
and plastic waste generation is capped as de- 
scribed above, a 77% reduction could be real- 
ized, reducing the annual input of plastic waste 
to the ocean to 2.4 to 6.4 million MT by 2025 
(table S3). 

Sources of uncertainty in our estimates re- 
sult from the relatively few measurements of 
waste generation, characterization, collection, 
and disposal, especially outside of urban centers. 
Even where data were available, methodologies 
were not always consistent, and some activities 
were not accounted for, such as illegal dumping 
(even in high-income countries) and ad hoc re- 
cycling or other informal waste collection (espe- 
cially in low-income countries). In addition, we 
did not address international import and export 
of waste, which would affect national estimates 
but not global totals. Although national estimates 
are somewhat sensitive to the model predicting 
the percentage of mismanaged waste, the global 
estimate and ranking of top countries are not. 
The long-term projections are also sensitive to 
the model predicting growth of plastic in the 
waste stream; historical growth may not be a 
good indicator of future trends (12). The inclu- 
sion of the economic cost of implementation, 
as well as socio-cultural, environmental, and 
other factors that affect infrastructure devel- 
opment or behavioral change, would improve 
the evaluation of mitigation strategies (19). 

We will not reach a global “peak waste” be- 
fore 2100 (20). Our waste will continue to grow 
with increased population and increased per 
capita consumption associated with economic 
growth, especially in urban areas and developing 
African countries (see supplementary materials). 
Historically, waste management by burying or 
burning waste was sufficient for inert or bio- 
degradable waste, but the rapid growth of syn- 
thetic plastics in the waste stream requires a 
paradigm shift. Long-term solutions will likely 
include waste reduction and “downstream” waste 
management strategies such as expanded re- 
covery systems and extended producer respon- 
sibility (21, 22). Improving waste management 
infrastructure in developing countries is para- 
mount and will require substantial resources and 
time. While such infrastructure is being devel- 
oped, industrialized countries can take imme- 
diate action by reducing waste and curbing the 
growth of single-use plastics. 
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VIRAL REPLICATION 

Structural basis for RNA replication 
by tbe hepatitis C virus polymerase 
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Nucleotide analog inhibitors have shown clinical success in the treatment of hepatitis 
C virus (HCV) infection, despite an incomplete mechanistic understanding of NS5B, the 
viral RNA-dependent RNA polymerase. Here we study the details of HCV RNA replication 
by determining crystal structures of stalled polymerase ternary complexes with enzymes, 
RNA templates, RNA primers, incoming nucleotides, and catalytic metal ions during both 
primed initiation and elongation of RNA synthesis. Our analysis revealed that highly 
conserved active-site residues in NS5B position the primer for in-line attack on the 
incoming nucleotide. A p loop and a C-terminal membrane-anchoring linker occlude the 
active-site cavity in the apo state, retract in the primed initiation assembly to enforce 
replication of the HCV genome from the 3' terminus, and vacate the active-site cavity 
during elongation. We investigated the incorporation of nucleotide analog inhibitors, 
including the clinically active metabolite formed by sofosbuvir, to elucidate key molecular 
interactions in the active site. 


H epatitis C vims (HCV) is a positive-sense, 
single-stranded RNA virus of the family 
Flaviviridae and genus Hepacivirns and 
is the cause of hepatitis C in humans (i). 
Long-term infection with HCV can lead to 
end-stage liver disease, including hepatocellular 
carcinoma and cirrhosis, making hepatitis C 
the leading cause of liver transplantation in the 
United States {2). Direct-acting antiviral drugs 
were approved in 2011, but they exhibited limited 
efficacy and had the potential for adverse side 
effects {3). The catalytic core of the viral replica- 
tion complex, the NS5B RNA-dependent RNA 
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polymerase (RdRp), supports a staggering rate of 
viral production, estimated to be 1.3 x lO^^ vi- 
rions produced per day in each infected patient 
(4). Because the NS5B polymerase active site is 
highly conserved, nucleotide analog inhibitors 
offer advantages over other classes of HCV dmgs, 
including activity across different viral genotypes 
and a high barrier to the development of resist- 
ance (5, 6). The nucleotide prodmg sofosbuvir 
was recently approved for combination treatment 
of chronic HCV (7, 8). 

One substantial obstacle for the rapid discov- 
ery of effective nucleotide-based dmgs for HCV 
was the lack of molecular detail concerning sub- 
strate recognition during replication. NS5B con- 
tains several noncanonical polymerase elements, 
including a C-terminal membrane anchoring tail 
and a thumb domain (3-loop insertion {9-11), that 
are implicated in RNA synthesis initiation {12). 


To gain insight into the mechanism of HCV RNA 
replication and its inhibition by nucleotide ana- 
log inhibitors, we determined atomic-resolution 
ternary stmctures of NS5B in both primed ini- 
tiation and elongation states. 

Because traditional approaches failed to yield 
ternary complexes (see the supplementary mate- 
rials), we prepared multiple stalled enzyme-RNA- 
nucleotide ternary complex stmctures containing 
several designed features. First, we used NS5B 
from the JFH-1 genotype 2a isolate of HCV, which 
is extraordinarily efficient at RNA synthesis {13). 
Second, we exploited a conformational stabiliza- 
tion strategy that had been developed for stmc- 
tural analysis of G protein-coupled receptors {14). 
We hypothesized that a triple resistance NS5B 
mutant isolated under selective pressure of a 
guanosine analog inhibitor that exhibits 1.5 times 
the initiation activity of the wild type {15) might 
stabilize a specific conformational state along 
the initiation pathway. Indeed, this triple mutant 
exhibits a substantial stmctural rearrangement 
of the polymerase {15), which is consistent with 
the stmctural rearrangement observed in binary 
complexes of a (3-loop deletion mutant bound to 
primer-template RNA {16). The triple mutant was 
able to incorporate native and nucleotide analog 
inhibitors with the RNA samples used in stmcture 
determination (fig. SI). The use of nucleotide 
diphosphate substrates rather than nucleotide 
triphosphates (fig. S2) generates stalled polymerase 
complexes in a catalytically relevant conforma- 
tion. Ternary complexes could be obtained only 
with Mn^"^, which lowers the Michaelis constant 
(Am) of the initiating nucleotide {17) and increases 
the activity of NS5B 20-fold relative to Mg^^ {18), 
and only with a nucleotide/Mn^Vdouble-stranded 
RNA ratio of 1.0/0.6/0.2. These approaches de- 
signed to stabilize the incoming nucleotide al- 
lowed for soaking experiments targeting several 
distinct assemblies. 

Hepatitis C vims NS5B initiates RNA synthesis 
by a primer-independent mechanism. Two slow 
steps in the catalytic pathway have been identified, 
including the formation of an initial dinucleotide 
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primer and the transition from the dinucleotide- 
primed state to a rapid, processive elongation 
state {19, 20). We obtained crystal structures of 
ternary complexes containing NS5B, two Mn^^ 


ions, an RNA template, and a dinucleotide prim- 
er by soaking nucleic acid and manganese ions 
into a previously elucidated apo crystal form (i5) 
(see supplementary materials). Stalled ternary 


structures were obtained either with a 5 -pGG RNA 
primer and an incoming adenosine diphosphate 
(ADP) (2.2 A resolution), cytidine diphosphate 
(CDP) (2.5 A), or uridine diphosphate (UDP) (2.0 A) 



5 -G • C-3' 
3 -G • C 
ppU - A 
U-5' 


template 


Thumb 


-1 

0 

+1 




HCV 
primed 
initiation 


Fig. 1. NS5B primed initiation assembly. (A) Overall structure of the stalled 
NS5B ternary primed initiation complex. The protein is represented by ribbons 
and colored by subdomain (pink, fingers: light blue, palm; pale green, thumb), 
with the p loop highlighted in yellow and the position of the last visible residue 
at the C terminus labeled “C.” The RNA template (5'-UACC: cyan carbons), 
5'-pGG dinucleotide primer (magenta carbons), and incoming UDP nucleotide 
(green carbons) are represented by sticks and colored according to atom type 
(red, oxygen: blue, nitrogen: orange, phosphate). The two catalytic Mn^"^ ions 


are shown as purple spheres. (B) (p6 RdRp de novo initiation assembly, with the 
priming nucleotide colored by atom type with brown carbons, the incoming 
nucleotide colored by atom type with magenta carbons, and an active-site loop 
shown in yellow. (C) Primed initiation assembly of HGV, which is one catalytic 
step after the de novo initiation assembly. Thus, the dinucleotide primer is 
colored as in (B) after catalysis, with the next incoming nucleotide colored by 
atom type with green carbons. The 2|Fol - \Fc\ electron density map is con- 
toured at lo and superimposed on the refined ligand and p-loop atoms. 
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Fig. 2. Recognition of the incoming nucleotide. (A) Stereoscopic view of the 
NS5B active site during primed initiation. Select protein atoms are represented by 
sticks and are colored by atom type with gray carbons, except for the p-loop 
residues, which are highlighted in yellow. RNA bases are labeled according to 
standard polymerase numbering conventions. Protein-ligand hydrogen bonds are 
shown as gray dashed lines, whereas base-pair hydrogen bonds are shown as red 
dashed lines. The proposed path of in-line attack by the 3'-hydroxyl on the a 
phosphate of the incoming nucleotide is illustrated by a green dashed line. (B) 
Glose-up view of the RNA template binding site. (C) Gomparison of the 3' end of 
RNA primers, metal ions, and incoming nucleotides from the Norwalk virus 


polymerase ternary complex containing cytidine triphosphate (GTP) as the in- 
coming nucleotide [PDB ID: 3BSO (23)] and the NS5B ternary complex containing 
UDP as the incoming nucleotide. The Norwalk structure atoms are colored ac- 
cording to atom type with yellow carbons and gold metal ions, whereas the NS5B 
ternary complex atoms are colored by atom type with green carbons and purple 
metal ions. (D) Gommon chemical mechanism of polymerases (24). (E) Molecular 
mechanism for recognizing ribonucleotide substrates. Protein atoms of the apo 
enzyme are colored with gray carbons, whereas protein atoms of the substrate 
complex are colored with yellow carbons. Dashed lines represent the hydrogen 
bonding network formed upon binding to an incoming ribonucleotide. 


772 13 FEBRUARY 2015 • VOL 347 ISSUE 6223 


sciencemag.org SCIENCE 


RESEARCH \ REPORTS 


(Fig. 1) or with a 5'-pCC RNA primer and an 
incoming UDP (2.15 A) or guanosine diphos- 
phate (GDP) (2.8 A) (fig. S3 and tables SI to S3). 
These results show that primed assemblies form 
via both purine and pyrimidine dinucleotide 
primers, with all possible natural ribonucleotides 
as incoming substrates. 

The 5 -untranslated region of the HCV genome 
contains an internal ribosomal entry site, ne- 


cessitating replication at the exact 3 ' end of the 
viral genome before copyback of the (-) strand 
into a (+) strand. The NS5B thumb domain (3- 
loop insertion has been proposed to position the 
3' terminus of the genomic template during 
initiation ( 12 ), yet the (3 loop and the C-terminal 
membrane-anchoring linker appear in a con- 
formation too deep v^thin the active site to be 
compatible with binding to the RNA template 


and incoming nucleotides (9-11). In the five 
primed initiation ternary assemblies presented 
here, the fingertips and thumb domains have 
undergone substantial rearrangements to accom- 
modate the nucleic acid (fig. S4). Furthermore, 
the (3 loop retracted 5 A relative to several apo 
HCV polymerase genotype 2a structures (13, 15), 
providing space for RNA replication initia- 
tion (Fig. lA) (fig. S4). These structures reveal 



Fig. 3. NS5B elongation assembly and 2' modified nucleotide inhibitor 
recognition. (A) Overall structure of the stalled NS5B ternary elongation 
complex. The protein is represented by ribbons and is colored and labeled 
as in Fig. lA. The sequence and position of the truncated p loop (16) are 
highlighted in yellow. The self-complementary RNA is depicted with the 
template strand in cyan and the primer strand in magenta and numbered 
according to convention: residues lacking electron density are depicted in 
gray.The incoming UDP nucleotide is represented in green.The two catalytic 
Mn^^ ions are shown as purple spheres. (B) Close-up view of the active site. 
Substrates are colored as in (A), and select protein residues are represented 


by sticks, colored by beige carbons, and labeled accordingly. The hydrogen 
bonding network involved in 2'-hydroxyl recognition of the incoming nu- 
cleotide is indicated with dashed lines. (C to E) Close-in views comparing 
the active sites with (C) UDP (gray carbons), (D) 2'-CH/2'-CH3-UDP (yellow 
carbons), or (E) 2'-F/2'-CH3-UDP (diphosphate metabolite of sofosbuvir; 
brown carbons). The hydrogen bond networks involved in recognizing the 
2'-hydroxyl of ribonucleotide substrates are shown by dashed lines. Binding of 
2'-F/2'-CH3-UDP reveals a disruption in the normal hydrogen bonding pattern 
observed for natural nucleotide substrates and 2'-CH/2'-CH3-containing 
analogs. 
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Fig. 4. Model of HCV replication by NS5B. Schematic of representative 
steps during RNA synthesis by HCV NS5B RdRp. In the apo form, a portion 
of the RNA binding groove is occluded by both the C terminus (blue line) and 
the p loop (yellow). During de novo initiation, the RNA template and in- 
coming and priming nucleotides enter the active site. Catalysis results in 
the formation of an initial dinucleotide primer (first slow step). The next 


nucleotide is incorporated into the dinucleotide primed initiation assembly. 
De novo initiation and primed initiation are often referred to collectively as 
“initiation,” although they are distinct states. Further conformational changes 
result in movement of the p loop and C terminus out of the RNA binding 
groove (second slow step), allowing the enzyme to transition into a pro- 
cessive elongation state. 
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molecular details of a platform for RNA synthesis 
in which the tip of the (3 loop now buttresses 
the end of the short RNA duplex. A similar 
global arrangement has been observed previ- 
ously in the de novo initiation assembly of RNA 
bacteriophage (p6 RdRp (21) (Fig. IB), which illus- 
trates the catalytic event preceding the primed 
initiation state of HCV (Fig. 1C). These (3-loop 
interactions appear critical for setting the regis- 
ter to ensure that the polymerase initiates tran- 
scription at the 3' end of the viral genome. In 
general, the (3-loop residues exhibit increased 
temperature factors and contain weaker electron 
density compared with the apo enzyme, indi- 
cating that the (3 loop starts to become disordered 
during primed initiation (fig. S5). In addition to 
retraction of the (3 loop, the C terminus vacated 
the active-site cavity now occupied by nucleic acid 
and appeared disordered beyond residue T552 
(22) in the primed assemblies. These movements 
generate space to accommodate only two Watson- 
Crick pairs upstream of the incoming nucleotide, 
suggesting that further conformational changes 
are required to accommodate additional phos- 
photransfer and translocation events. This in- 
cludes, at a minimum, an opening of the thumb 
domain via reorientation of the (3 loop. Thus, 
these structures demonstrate the polymerase as- 
sembly before the second slow step in RNA repli- 
cation (19). 

Mutational analysis of NS5B revealed R386 
of the primer grip motif to be important for 
dinucleotide-initiated RNA synthesis (12), and 
the primed-state assemblies show that both R386 
and R394 of the primer grip helix form salt 
bridges with the 5'-phosphate of the dinucleotide 
primer (Fig. 2A). The conserved catalytic residues 
D220, D318, and D319 coordinate the two catalyt- 
ic Mn^^ ions, which in turn coordinate the a and 
(3 phosphates of the incoming nucleotide. Con- 
served basic residues R48 and R158 coordinate 
the a and (3 phosphates opposite the metal ions. 
The incoming nucleotide forms a Watson-Crick 
interaction with the pairing residue of the tem- 
plate strand, which packs against conserved hy- 
drophobic residues 1160 and Y162 (Fig. 2B). The 
3'-hydroxyl of the dinucleotide primer forms an 
inner-sphere coordination with a catalytic metal 
ion in an in-line conformation with the scissile 
bond of the incoming nucleotide, nearly identical 
to that observed for the Norwalk virus RdRp (23) 
(Fig. 2C). Thus, the nucleotide diphosphates ex- 
hibit enzymatically competent conformations con- 
sistent with the common polymerase mechanism 
(24) (Fig. 2D). 

In HIV-1 reverse transcriptase, Y115 provides 
specificity for deoxynucleotide triphosphates by 
serving as a steric gate to prevent the binding of 
ribonucleotide triphosphates (rNTPs) (23), and it 
was predicted that the structurally homologous 
residue in HCV, conserved D225, would be in- 
volved in recognition of the 2'-hydroxyl of incom- 
ing rNTPs (9-11). In the NS5B ternary complexes, 
the main chain of conserved S282 flips, allowing 
its side chain to swing out and hydrogen bond 
with the 2'-hydroxyl of the substrate and the car- 
boxylic acid of D225, which moves away from the 


nucleotide substrate during binding (Fig. 2E). 
The 2'-hydroxyl of the incoming ribonucleotide 
also forms a direct hydrogen bond with the side 
chain amine of N291 on the opposite face of the 
ribose ring. This network of hydrogen bonds, to- 
gether with complementary base-pairing to the 
template, provides the structural basis for recog- 
nition of the correct ribonucleotide substrate. 

Crystal structures of a (3-loop deletion con- 
struct of the HCV NS5B polymerase were solved 
as apo (2.5 A resolution) or via soaking (see 
supplementary materials and methods) with a 
symmetrical RNA primer-template pair (16) and 
an incoming UDP (2.8 A), CDP (2.75 A), ADP 
(2.7 A), or GDP (2.9 A) (Fig. 3, figs. S6 to S8, and 
tables S4 to S6). These ternary complexes prob- 
ably represent the highly processive elongation 
phase of viral genome replication after the tran- 
sition from the primed state in the second slow 
step of polymerization (19). These high-resolution 
elongation state structures were obtained via 
soaking into the same crystal form as the triple- 
mutant structures with the intact (3 loop but 
could only be obtained with a construct contain- 
ing both the triple mutant (13) and the (3-loop 
deletion (16) (see supplementary materials). 
Overall, there is excellent overlap between the 
catalytic residues, the 3' end of the primer, and 
the incoming nucleotide when comparing the 
elongation complexes with the primed initiation 
assemblies, including the same in-line confor- 
mation of the 3'-hydroxyl of the primer with the 
scissile bond. The thumb domain moved away 
from the palm and fingers domains by an addi- 
tional 1.5 A for similar Ca atoms, demonstrating 
a slightly more relaxed state of the polymerase 
during elongation. In addition, the C-terminal 
residues downstream of A534 have evacuated the 
RNA binding groove and become disordered, pre- 
venting overlap with the template strand. Thus, 
these structures provide further evidence for con- 
certed movements of the (3 loop, the thumb do- 
main, and the C terminus once RNA elongation 
begins. Moreover, they provide the structural basis 
for the hypothesis that these elements provide 
a “swinging gate” that allows the polymerase to 
initiate at the terminus of the RNA genome and 
then transition to a processive elongation state, 
thereby replicating the complete genome (12). 

The crystal structures presented here lead us 
to propose a model of the structural events in- 
volved in HCV genome replication (Fig. 4). At the 
outset, the (3 loop and the C-terminal membrane- 
anchoring linker are buried within the encircled 
active-site cavity. In the first of two slow steps in 
HCV RNA replication (19), the 3' end of the viral 
RNA template and the incoming nucleotides 
enter the active site, possibly with accompanying 
conformational changes, and the initial phosphoryl 
transfer step generates a dinucleotide primer. 
This de novo initiation step immediately precedes 
the primed initiation assembly captured here. At 
this early stage, the complex remains unstable, 
which may account for the observed large quan- 
tity of two- to four-nucleotide-long abortive tran- 
scripts (26, 27). As the dinucleotide primer is 
extended by another one to three nucleotides, the 


build-up of tension displaces the (3 loop and the 
remaining C-terminal residues, further opening 
the cavity and allowing the RNA duplex to exit 
during the second slow step in replication (19). 
With both the (3-hairpin loop and the C terminus 
expelled from the active-site cavity, the poly- 
merase transitions into the highly processive elon- 
gation mode also captured here. 

By using an extensive hydrogen bond network 
to recognize the 2'-hydroxyl of the incoming nu- 
cleotide (Fig. 3C), HCV NS5B displays stringent 
selectivity for ribonucleotides. Consequently, 2'- 
deoxyribonucleotide chain terminators such as 
azidothymidine are ineffective against HCV, whereas 
2 -modified nucleotides are effective HCV antivirals 
(6). The nucleotide analog inhibitor sofosbuvir 
is a 2'-F-2'-C-methyluridine monophosphate pro- 
drug (28-30) and is approved for the treatment 
of chronic HCV infection (7, 8). Efficacy of chain- 
terminating nucleotide analogs requires viral 
RdRps to recognize and successfully incorporate 
the inhibitors into the growing RNA strand. To gain 
insight into the molecular details of 2'-modified 
nucleotide analog recognition, we determined 
elongation-phase temaiy complexes of both 2 -OH/ 
2'-CH3-UDP and 2'-F/2'-CH3-UDP at 2.65 and 2.9 A 
resolution, respectively (table S7). The stalled ter- 
nary complex with 2'-OH/2 -CH3-UDP as the in- 
coming nucleotide was essentially identical to the 
UDP-bound elongation assembly, with S282 under- 
going the same conformational change to interact 
directly with the 2'-hydroxyl group and hydrogen 
bond with D225 (Fig. 3D). Although the addition 
of the 2 -C-methyl group of the inhibitor places it 
within 3.1 A of the S282 Oy, previous biochemical 
studies using 2'-C-methyl nucleotides reveal that 
these analogs are readily incorporated into the 
growing chain with values approaching those 
of the natural ribonucleotide substrates (31). In 
contrast, the trapped elongation assembly con- 
taining 2'-F/2'-CH3-UDP (i.e., sofosbuvir diphos- 
phate) as the incoming nucleotide reveals that 
the hydrogen bonding network is disrupted (Fig. 
3E). D225 is oriented away from the incoming 
nucleotide, but S282 remains in the same con- 
formation as in the apo enzyme. The loss of the 
hydrogen bonding network involving S282 re- 
sults in a substantially higher Xm for 2 -F/2'-CH3- 
modified nucleotides. Nevertheless, recognition 
of the 2'-F by N291 and Watson-Crick pairing 
with the template allow sofosbuvir to form the 
in-line conformation necessary for incorporation 
into the growing chain, thereby promoting non- 
obligate chain termination. Key contacts formed 
by S282 with the incoming nucleotide and the 
surrounding environment give insight into the 
in vitro selection of a threonine as a potential 
resistance mutation to some 2 -CH3-modified nu- 
cleotides (32, 33), although S282^T282 has been 
infrequently observed in the clinic (34). In par- 
ticular, a steric clash between the T282 side chain 
and the 2'-CH3 would be predicted based on the 
structure determined with 2 -OH/2 -CH3-UDP. 

The results presented here define the structural 
requirements for HCV genomic replication from 
primed initiation to elongation and demonstrate 
the structural basis for inhibitor recognition. The 
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primed initiation state explains much of the 
known biochemistry behind the slow steps in 
the enzymatic pathway and highlights the dif- 
ferences between Flaviviridm RdRps and other 
polymerases. These structural and mechanistic 
differences have been exploited for the develop- 
ment of HCV nucleotide therapeutics that fea- 
ture pangenomic activity and a high barrier to 
the development of drug resistance. Thus, they 
may provide an avenue for the development of 
therapeutics against related viruses. 

REFERENCES AND NOTES 

1. R. Bartenschlager, V. Lohmann, J. Gen. Virol. 81 , 1631-1648 
( 2000 ). 

2. W. R. Kim et al., Am. J. Transplant. 14 (suppl. 1), 69-96 
(2014). 

3. F. Poordad, D. Dieterich, J. Viral Hepat. 19 , 449-464 (2012). 

4. A. U. Neumann et al., Science 282 , 103-107 (1998). 

5. S. Le Pogam et al., J. Antimicrob. Chemother. 61 , 1205-1216 
(2008). 

6. M. J. Sofia, Adv. Pharmacol. 67, 39-73 (2013). 

7. E. J. Gane et al., N. Engl. J. Med. 368, 34-44 (2013). 

8. R. S. Koff, Aliment. Pharmacol. Then 39, 478-487 (2014). 

9. H. Ago et al.. Structure 7, 1417-1426 (1999). 

10. S. Bressanelli et al., Proc. Natl. Acad. Sci. U.S.A. 96 , 
13034-13039 (1999). 

11. C. A. Lesburg et al., Nat. Struct. Biol. 6, 937-943 (1999). 

12. Z. Hong et al.. Virology 285 , 6-11 (2001). 


13. P. Simister et al., J. Virol. 83, 11926-11939 (2009). 

14. R. Abrol, S. K. Kim, J. K. Bray, B. Trzaskowski, W. A. Goddard III, 
Methods Enzymol. 520 , 31-48 (2013). 

15. A. M. Lam et al., Antimicrob. Agents Chemother. 58 , 
6861-6869 (2014). 

16. R. T. Mosley et al., J. Virol. 86, 6503-6511 (2012). 

17. G. T. Ranjith-Kumar et al., J. Virol. 76, 12513-12525 (2002). 

18. G. Luo et al., J. Virol. 74 , 851-863 (2000). 

19. D. Harrus et al., J. Biol. Chem. 285, 32906-32918 (2010). 

20. N. Scrima et al., J. Virol. 86, 7107-7117 (2012). 

21. S. J. Butcher, J. M. Grimes, E. V. Makeyev, D. H. Bamford, 

D. 1. Stuart, Nature 410 , 235-240 (2001). 

22. Single-letter abbreviations for the amino acid residues are as 
follows: A, Ala; C, Cys; D, Asp; E, Glu; E, Phe; G, Gly; H, His; 

I, He; K, Lys; L, Leu; M, Met; N, Asn; P, Pro; Q, Gin; R, Arg; 

S, Sen T, Thr; V, Val; W, Trp; and Y, Tyr. 

23. D. E. Zamyatkin et al., J. Biol. Chem. 283 , 7705-7712 
(2008). 

24. T. A. Steitz, Nature 391, 231-232 (1998). 

25. H. Huang, R. Ghopra, G. L. Verdine, S. G. Harrison, Science 
282 , 1669-1675 (1998). 

26. H. Dutartre, J. Boretto, J. G. Guillemot, B. Ganard, J. Biol. Chem. 
280 , 6359-6368 (2005). 

27. W. Zhong et al., J. Virol. 74 , 9134-9143 (2000). 

28. A. M. Lam et al., Antimicrob. Agents Chemother. 56 , 
3359-3368 (2012). 

29. E. Murakami et al., J. Biol. Chem. 285 , 34337-34347 
( 2010 ). 

30. M. J. Sofia et al., J. Med. Chem. 53, 7202-7218 (2010). 

31. H. Dutartre, G. Bussetta, J. Boretto, B. Ganard, Antimicrob. 
Agents Chemother. 50 , 4161-4169 (2006). 


32. S. S. Carroll et al., J. Biol. Chem. 278 , 11979-11984 
(2003). 

33. A. M. Lam etal., Antimicrob. Agents Chemother. 54 , 3187-3196 
( 2010 ). 

34. E. S. Svarovskaia et al., Clin. Infect. Dis. 59, 1666-1674 
(2014). 

ACKNOWLEDGMENTS 

We thank D. Smith at the Advanced Photon Source, Argonne 
National Labs, Life Sciences Collaborative Access Team (LS-CAT), 
for assistance in data collection, as well as our co-workers from 
Pharmasset, Beryllium, and Gilead Sciences, who have participated 
on the HCV polymerase collaboration. Coordinates and structure 
factors have been deposited with the Protein Data Bank under 
accession codes 4WT9, 4WTA, 4WTC, 4WTD, 4WTE, 4WTE, 4WTG, 
4WTI, 4WTJ, 4WTK, 4WTL, and 4WTM. Expression plasmids are 
freely available from the authors. Gilead Sciences and the authors 
(T.E.E. and E.M.) have filed international patent application number 
PCT/US2013/021130, which relates to crystal structures of HCV 
polymerase complexes and their methods of use. 

SUPPLEMENTARY MATERIALS 

www.sciencemag.org/content/347/6223/771/suppl/DCl 

Materials and Methods 

Supplementary Text 

Eigs. SI to S8 

Tables SI to S7 

References (35-43) 

25 July 2014; accepted 18 December 2014 
10.1126/science.l259210 


TIGHT JUNCTIONS 

Structural insight into tight junction 
disassembly by Clostridium 
peijringens enterotoxin 

Yasunori Saitoh/’^* Hiroshi Suzuki/*t Kazutoshi Tani/* Kouki Nishikawa/ 

Katsumasa Irie/’^ Yuki Ogura,^ Atsushi Tamura,^ 

Sachiko Tsukita,^ Yoshinori Fujiyoshi^’^t 

The C-terminal region of Clostridium perfringens enterotoxin (C-CPE) can bind to specific 
claudins, resulting in the disintegration of tight junctions (TJs) and an increase in the 
paracellular permeability across epithelial cell sheets. Here we present the structure of 
mammalian claudin-19 in complex with C-CPE at 3.7 A resolution. The structure shows that 
C-CPE forms extensive hydrophobic and hydrophilic interactions with the two extracellular 
segments of claudin-19. The claudin-19/C-CPE complex shows no density of a short 
extracellular helix that is critical for claudins to assemble into TJ strands. The helix 
displacement may thus underlie C-CPE-mediated disassembly of TJs. 


I nfection with Chstridium perfringens type A 
by eating contaminated food is a common 
cause of food poisoning in humans and ani- 
mals. In the intestines, this bacterium pro- 
duces Clostridium perfringens enterotoxins 
(CPEs) that trigger foodbome illness (i). Upon 
binding to their receptor, the complexes aggre- 
gate on the intestinal cell surface and form a 
large oligomer that inserts into the membrane 
and forms an ion pathway. The resulting Ca^^ 
influx triggers cell death {2, 3). The receptors 
for CPE, initially named CPE-R and RVP-1 (4, 5), 
were later recognized as claudin-4 and -3, re- 
spectively (d), based on their sequence simi- 


larity with claudin-1 and -2, known constituents 
of cell-to-cell tight junctional complexes (7, 8). 
The carboxyl-terminal half of CPE (C-CPE) me- 
diates the interaction with specific claudins 
{9, 10), which reversibly modulates the para- 
cellular permeability of tight junctions (TJs), 
whereas its amino-terminal half is responsible 
for pore formation and thus for cellular cyto- 
toxicity {11). 

Claudins have a common structural topology 
consisting of four transmembrane (TM) segments; 
a large first extracellular segment (ECSl), which 
contains the claudin consensus motif; and a 
shorter second extracellular segment (ECS2) 


{12-14). The adhesion and polymerization prop- 
erties of claudins enable them to form linear 
polymers, called TJ strands, which connect ad- 
jacent cells and form the structural backbone of 
TJs (75). TJs serve mainly as barriers that re- 
strict the diffusion of solutes through intercellu- 
lar spaces in epithelial and endothelial cell sheets 
{16), thus separating internal tissue compart- 
ments from external environments to maintain 
the homeostasis of our bodies {8). 

To understand the structural basis for how 
C-CPE recognizes specific claudins, we expressed 
all mouse claudin subtypes in Sf9 insect cells and 
assessed their capacity to bind C-CPE by using 
fluorescent-detection size-exclusion chromatog- 
raphy (FSEC) (77). Mouse claudin-19 showed con- 
siderable affinity for C-CPE (fig. SIA) {18). When 
expressed in a mammalian epithelial-like cell 
line, mouse claudin-19 formed TJs in the plasma 
membranes of cell-to-cell contact regions (Fig. lA). 
Although a previous study reported that a syn- 
thetic ECS2 peptide of claudin-19 had no affinity 
for CPE (9), a 24-hour incubation with C-CPE 
resulted in a significant delocalization of the 
claudin-19 signal away from the junctional bor- 
ders (Fig. lA), suggesting that binding of C-CPE 
causes claudin-19 to dissociate from TJs. The 
disruption of TJs by incubating cells expressing 
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Cldn19 ZO-1 merge(Cldn19/Z0-1) 


Fig. 1. Effect of C-CPE on TJs formed by claudin-19 and overall structure of 
mCldnl9 in complex with C-CPE. (A) Immunofluorescence microscopy of SF-7 
epithelial-like cells stably expressing mouse claudin-19. Confluent SF-7 cells plated 
on filters were incubated for 24 hours without (control) or with 100 iig/ml C-CPE 
(+C-CPE) and then stained with antibodies to claudin-19 and ZO-1. Scale bar, 

10 |im. (B) Structure of mCldnl9cryst in complex with C-CPE shown in ribbon 
representation and viewed parallel to the membrane. The color of mCldnl9cryst changes from the N terminus (blue) to the C terminus (red), and C-CPE is 
colored green. Dashed lines indicate disordered regions. The gray bars suggest the membrane boundaries of the outer (Ext.) and inner (Cyt.) leaflets. 


Fig. 2. Interactions of 
mCldnl9cryst with C-CPE. (A) 

Close-up view of the extracellular 
domains of mCldnl9cryst (ribbon 
representation) and C-CPE (sur- 
face representation). Colors are 
the same as in Fig. 1. Side chains 
of residues in conserved claudin 
sequences involved in C-CPE 
binding are shown in stick repre- 
sentation and labeled. Key C-CPE 
residues for claudin binding are 
colored yellow and labeled. (B and 
C) The extracellular domains of 
mCldnl9cryst and mCldnlS are 
shown as ribbon diagrams in solid 
colors and in transparent gray, 
respectively. Hydrophobic residues 
involved in linear claudin oligomer- 
ization are shown in stick repre- 
sentation and labeled. The ECH 
region, which is disordered in the 
mCldnl9cryst structure, is indicated 
by a dashed line. (D) Close-up view 
of the interaction of ECS2 with 
C-CPE. Main-chain carbonyl 
groups and side chains of ECS2 
residues involved in interactions 
with C-CPE are shown in stick 
representation. (E) Close-up view 
of the interaction of ECSl with 
C-CPE. Side chains in the loop 
region are shown in stick 
representation. Inter- and 
intramolecular hydrogen bonds 
and ionic bonds are shown by green, 
green, salmon, or gray, respectively. 



cyan, and red dashed lines, respectively. C-CPE residues that interact with ECSl, ECS2, or both are colored in light 
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claudin-19 with C-CPE was also seen in freeze- 
fracture electron microscopy images (fig. SI, B 
and C). We used FSEC to quantify the affinity of 
C-CPE for claudin-19, as well as for claudin-3 and 
claudin-1 as positive and negative controls, re- 
spectively. Our analysis yielded the apparent dis- 
sociation constant (Kq s) values for C-CPE binding 
of 240 + 18 nM and 7.9 ± 0.2 nM for claudin-19 
and -3, respectively, which is consistent with pre- 
vious reports (11, 19), and we found only neg- 
ligible binding for claudin-1 (fig. S2B). 

For crystallization, we removed 26 residues 
from the C terminus of mouse claudin-19 and 


substituted three membrane-proximal cysteines 
with alanines (fig. S3). After crystallization of 
this construct, mCldnl9crystj in complex with 
C-CPE, the structure was determined at 3.7 A 
resolution using the molecular replacement 
method (Fig. IB, fig. S4, and table SI). Crystals 
of mCldnl9cryst with bound C-CPE contain two 
1:1 claudin-toxin complexes per asymmetric unit 
(fig. S5), which, despite different crystal con- 
tacts, have almost identical conformations, 
with an average root mean square deviation 
(RMSD) between their Ca atoms of 0.39 A (fig. 
S5, B and C). 


The overall structure of mCldnl9cryst is simi- 
lar to that of mCldnl5 (14), a CPE-insensitive 
claudin, with the four TM segments forming a 
typical left-handed bundle (Fig. IB and fig. S6). 
In addition, the binding does not induce a 
conformational change in C-CPE (fig. S7). How- 
ever, the two extracellular (3-sheet segments of 
mCldnl9crysb ECSl and ECS2, are oriented dif- 
ferently compared with those of mCldnl5 (Fig. 2, 
A to C, and fig. S6). In mCldnlPayst, the loop 
between (31 and (32 (residues 35 to 42), which is 
disordered in the mCldnl5 structure (fig. S6), is 
clearly resolved and makes contact with C-CPE 



Fig. 3. The interface between mCldnl9cryst and C-CPE. (A) Surface representation of C-CPE with bound mCldnl9cryst shown in ribbon representation. C-CPE 
residues that interact with ECSl, ECS2, or both are labeled and colored in light green, salmon, or gray, respectively. (B and C) The surface of C-CPE that 
interacts with claudins viewed as in (A). (B) The C-CPE residues involved in the interaction with mCldnl9cryst are conserved and can thus mediate binding to all 
CPE-sensitive claudins. (C) Mapping of claudin-19 mutations that affect the binding affinity onto the surface of the interacting C-CPE residues. Violet indicates a 
significant reduction in binding affinity, and lime indicates a small reduction. 



Fig. 4. Structural insights into the disassembly of a TJ strand induced by C-CPE binding. The linear arrangement of claudin-19 in a TJ strand is 
modeled based on the mCldnlS crystal structure. The hydrophobic residues involved in the linear interactions are shown in stick representation and 
indicated by arrowheads (left panel). Binding of C-CPE disrupts the hydrophobic interactions between neighboring claudin protomers (arrowheads) 
and also creates steric clashes (double arrowhead) that prevent interactions with neighboring claudin protomers as well as with claudin protomers 
from an adjoining cell. 
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(Fig. 2, A and E). Also, the (35 strand in the loop 
connecting TM3 to TM4 is longer in mCldnl9ciyst 
than in niQdnl5 and can thus form more hydro- 
gen bonds with the adjacent (31 strand (Fig. 2B). In 
addition, whereas the disulfide bond between 
highly conserved cysteine residues at (33 and p4 
(C54 and C64, respectively) is also apparent in 
the mCldnl9cryst structure, the loop between (33 
and (34 seen in mCldnl5 is disordered (Fig. 2A 
and fig. S3). Finally, the residues in ECSl that 
connect (34 to TM2 (residues 69 to 74) are dis- 
ordered in mCldnl9ciysb whereas the correspond- 
ing region in mCldnl5 forms a short extracellular 
helix (ECH) that is oriented almost parallel to 
the membrane surface (Fig. 2C and fig. S4C). 

Experiments with ECS2-derived peptides sug- 
gested that the ECS2 region of Claudius con- 
tains a toxin-binding motif, NP(V/L)(V/L)(P/A), 
that is responsible for CPE binding {3, 9). This 
toxin-binding motif is probably located in a hy- 
drophobic cavity formed by three tyrosine resi- 
dues in C-CPE, ¥306*='’®, ¥310*^® and ¥312'='’® 
(throughout the text, CPE residues are indi- 
cated with superscripted CPE) (Fig. 2, A and D), 
thought to be a claudin-binding pocket (3, 9, 20). 
The ECS2 loop starting with a conserved aspar- 
agine residue, N150, stabilizes the complex with 
CPE by forming hydrogen bonds with the main 
chain carbonyl of P311^^^, the side chain of the 
positively charged R227^^^ residue, and so on 
(Fig. 2D and fig. S4B). Recently, a structure was 
determined of full-length CPE bound to a short 
peptide with a modified sequence of the claudin-2 
ECS2 (21). The ECS2 peptide was observed in the 
same hydrophobic cavity on C-CPE despite the 
differences in ECS2 sequences of claudin-2 and 
-19 (fig. S8). 

Because previous biochemical studies sug- 
gested that only ECS2 is involved in CPE binding 
(10, 19), the observed interaction of ECSl with 
C-CPE is unexpected. Although the sequences 
connecting the (31 and (32 strands in ECSl 
(residues 32 to 48) differ among claudin sub- 
types, the residues in the middle of this region 
(residues 39 to 42) are more conserved (fig. S3). 
The motif (A/N/S)-I-(I/L/V)-(T/V) in the claudin- 
19 structure clearly interacts with the surface of 
C-CPE, whereas the corresponding region is dis- 
ordered in the mCldnl5 structure (Fig. 2E and 
fig. S6). To test the binding site observed in the 
structure of the complex, we introduced point 
mutations in both ECSl and ECS2 of mCldnl9 in 
the region where they make contact with C-CPE 
(Fig. 3). The affinity of the mutants for C-CPE in 
detergent solution was assessed using FSEC (18) 
(table S2 and figs. S9 and SIO). The binding affin- 
ity for C-CPE was affected most by substitutions 
of residues in the putative binding motif in ECS2 
(N150-P154; Fig. 2, A and D) and of the three 
successive residues forming the conserved motif 
in ECSl (A39-I41; Fig. 2E). Mapping of these mu- 
tagenesis results onto the C-CPE surface (Fig. 3C) 
shows almost perfect agreement with the map- 
ping of the sequence conservation in CPE- 
sensitive claudins (Fig. 3B). Thus, both ECSl and 
ECS2 are involved in the physiological interac- 
tion with C-CPE. 


The surface of C-CPE that interacts with the 
extracellular claudin domains is mainly com- 
posed of hydrophobic residues (fig. Sll, A and B) 
but also contains residues that can form hydro- 
gen or ionic bonds, especially with ECS2 (Fig. 2 
and fig. SllC). The side chains of the residues 
forming the conserved motif in ECS2 (150N-P-S- 
T-P154 in mCldnl9) extend toward C-CPE and fit 
snugly into the hydrophobic cavity formed by the 
tyrosine triplet of C-CPE (Y306^^^, ¥310^^™ and 
Y 312 CPE) S4B). Mutation of these residues to 

alanine reduced the binding of CPE to claudin-4 
(22), and substitutions of S152 in ECS2 with hy- 
drophobic residues resulted in mCldnl9 variants 
with a higher affinity for C-CPE (table S2). Thus, 
the presence of a hydrophobic residue in ECS2 
that can fit into the hydrophobic cavity may 
determine whether a claudin is sensitive to CPE 
or not. The size of the side chain of N150 and the 
hydrogen bonds it could form were also critical 
for CPE binding, and a negative charge at this 
position inhibited CPE binding even if the pi of 
the ECS2 region remains unchanged (for details, 
see the materials and methods and table S2). 
These results are consistent with the finding that 
it is predominantly hydrophobic and uncharged 
residues that form the claudin-toxin interface 
around the CPE tyrosine triplet (indicated by the 
dashed line in fig. Sll). On the other hand, the 
residues of the conserved motif in ECSl (39A-I- 
141 in mCldnl9) among all claudins were sen- 
sitive to the size of their side chains (table S2). 
Therefore, it seems to be predominantly ECS2 that 
determines the sensitivity of a claudin for CPE, 
whereas ECSl contributes to the interaction sim- 
ply by enlarging the hydrophobic contact area. 

Structures of claudins with and without bound 
C-CPE could potentially provide insights into 
the disruption of TJ strands. The structure of 
mCldnl5, which does not bind CPE, enables us 
to build a homology model for mCldnl9 with- 
out bound C-CPE and thus to infer the confor- 
mational changes that may result from C-CPE 
binding (Fig. 4 and fig. S12). The claudin-CPE 
interaction would cause a disordering of the 
ECH region between (34 and TM2 (fig. S4C and 
S6B). Because the ECH could form hydropho- 
bic interactions with TM3 of the neighboring 
claudin protomer (14), it is conceivable that the 
movement of TM3 and the disordering of ECH 
induced by CPE binding destabilize the linear 
claudin polymer in the TJ strand. Furthermore, 
CPE binding would also interfere with the head- 
to-head interaction of claudin protomers in TJ 
strands in adjoining cells. Several studies support 
our insights (9, 23), but further experiments are 
needed to fully understand how CPE binding 
causes the disassembly of TJs. 

Our structure and mutational analyses of the 
interaction of C-CPE with claudin-19 may help in 
the design of drugs or biomarkers that selectively 
target specific claudin subtypes, regardless of 
their sensitivity to CPE (24, 2S). The molecular 
information on C-CPE-induced TJ disassembly 
may also be useful to design strategies aimed at 
increasing the permeability of drugs across TJs 
in the blood-brain barrier (26). 
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CHEMICAL BIOLOGY 

A small-molecule inhibitor of the 
aberrant transcription factor 
CBFp-SMMHC delays leukemia in mice 

Anuradha niendula/* * John A. Pvdikkan,^* Hon^iang Zong,^ Jolanta Grembecka,^ 

Liting Xue,^ Siddhartha Sen,^ Yunpeng Zhou/ Adam Boulton/ 

Aravinda Kuntimaddi/ Yan Gao/ Roger A. Rajewski/ Monica L. Guzman/ 

Lucio H. CastiUa/f John H. BushweUer^f 

Acute myeloid leukemia (AML) is the most common form of adult leukemia. The 
transcription factor fusion CBFp-SMMHC (core binding factor p and the smooth-muscle 
myosin heavy chain), expressed in AML with the chromosome inversion inv(16)(pl3q22), 
outcompetes wild-type CBFp for binding to the transcription factor RUNXl, deregulates 
RUNXl activity in hematopoiesis, and induces AML. Current inv(16) AML treatment with 
nonselective cytotoxic chemotherapy results in a good initial response but limited 
long-term survival. Here, we report the development of a protein-protein interaction 
inhibitor, AI-10-49, that selectively binds to CBFp-SMMHC and disrupts its binding to 
RUNXl. AI-10-49 restores RUNXl transcriptional activity, displays favorable pharmacokinetics, 
and delays leukemia progression in mice. Treatment of primary inv(16) AML patient blasts 
with AI-10-49 triggers selective cell death. These data suggest that direct inhibition of the 
oncogenic CBFp-SMMHC fusion protein may be an effective therapeutic approach for inv(16) 
AML, and they provide support for transcription factor targeted therapy in other cancers. 


A cute myeloid leukemia (AML) is the most 
common form of adult leukemia (1). Long- 
term survival for AML remains poor and 
varies ^vith the mutational composition of 
the leukemic cells. The transcription factor 
fusion CBFp-SMMHC (fusion of core binding fac- 
tor p and smooth-muscle myosin heavy chain), 
expressed in AML with the chromosome inver- 
sion inv(16)(pl3q22), cooperates with activating 
mutations in components of cytokine signaling 
pathways in leukemia transformation (2-5). CBFp 
is a component of the heterodimeric transcrip- 
tion factor core binding factor, where it binds 
to RUNX proteins and enhances their affinity 
for DNA (6), and the resulting complex plays a 
key role in regulating hematopoiesis (7). CBFp- 
SMMHC outcompetes CBFp for binding to RUNXl 
(8), deregulates RUNXl transcription factor activ- 
ity in hematopoiesis, and induces AML. Current 
inv(16) AML treatment with nonselective cyto- 
toxic chemotherapy results in a good initial re- 
sponse but limited long-term survival. Studies in 
mice and patient samples support the concept 
that inv(16) is a driver mutation that generates 
preleukemic progenitor cells that, upon acquisi- 
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tion of additional cooperating mutations, pro- 
gress to leukemia (3, 4, 9-12). 

To develop a taigeted inhibitor of CBFp-SMMHC 
function, we used a previously described fluores- 
cence resonance energy transfer (FRET) assay 
(13) with Venus-CBFp-SMMHC replacing Venus- 
CBFp (fig. SI) to screen the National Cancer In- 
stitute, NIH, Diversity Set for compounds that 
inhibit the binding of CBFp-SMMHC to the RUNXl 
Runt domain. This screen identified the active 
compound AI-4-57 with a 50% inhibitory con- 
centration (IC50) of 22 pM, whereas AI-4-88, a 
derivative lacking the methoxy functionality, is 
inactive (Table 1). Changes in the chemical shifts 
in a nuclear magnetic resonance (NMR) spectrum 
of a protein upon binding of a small molecule 
are a powerful method to confirm binding to a 
protein. We recorded two-dimensional 2D ^^-^H 
heteronuclear single quantum coherence (HSQC) 
spectra and ID saturation transfer difference 
(STD) NMR experiments of AI-4-57 with CBFp 
and the Runt domain. No interaction was ob- 
served for the Runt domain, but we can demon- 
strate chemical shift perturbations in the HSQC 
spectrum of CBFp upon addition of AI-4-57 (Fig. lA) 
and no changes upon addition of the inactive 
derivative AI-4-88 (fig. S2), which establishes that 
the compound binds to CBFp. Chemical shift per- 
turbations in the backbone and in two aromatic 
side chains [tryptophan at position 113 (W113) and 
tyrosine at position 96 (Y96)] indicate that the 
compound binds in a site spatially close to CBFp 
but not on the protein-protein interaction surface 
on CBFp, that is, it acts in an allosteric manner to 
inhibit binding (fig. S3). 

We have shown that a reduced dosage of CBFp 
in the presence of a CBFp-SMMHC knockin al- 


lele enhances leukemogenesis in mice (14) and 
argue that selectivity for CBFp-SMMHC versus 
CBFp is critical for in vivo utility. To achieve such 
specificity, we have taken advantage of the state 
of the two in solution: CBFp-SMMHC is oligo- 
meric, whereas CBFp is monomeric (8, IS) and 
have applied the principles of polyvalency (16, 17) 
to develop derivatives of AI-4-57 with enhanced 
potency and selectivity (Fig. IB). Substitutions at 
the five position of the pyridine ring do not affect 
activity (Table 1), so we have utilized polyethylene 
glycol-based linkers at this position to create 
bivalent derivatives with 5-, 7-, 10-, and 16-atom 
linker lengths (Table 1). Measurement of the IC50 
values v^th the FRET assay (Fig. 1C) shows that 
the five-atom linker compound has less activity, 
but the longer linker compounds show potent 
inhibition with the seven-atom linker inhibitor 
(AI-4-83) yielding a 350 nM IC50, which corre- 
sponds to a 63-fold enhancement over the mono- 
valent compound (Fig. ID and Table 1). In addition, 
AI-4-83 achieves >10-fold dissociation of CBFp- 
SMMHC and RUNXl Runt domain at saturating 
concentrations (Fig. ID). 

The activity of the bivalent inhibitors on cell 
growth was tested in three human leukemia 
cell lines [ME-1, inv(16) cell line; Kasumi-1, t(8;21) 
cell line; and U937, lymphoma cell line] by using a 
3-(4,5-dimethylthiazol-2-yi)-2,5-diphenyltetrazolium 
bromide (MTT) conversion assay. Mimicking the 
results obtained with the FRET assay, even in 
terms of relative efficacy, growth of inv(16) cell 
line ME-1 was sensitive to compounds AI-4-71, 
AI-4-83, and AI-4-82 but not to AI-10-19 (Fig. IE). 
In contrast, growth of non-inv(16) cell lines U937 
and Kasumi-1 was unaffected over the same con- 
centration range (fig. S4, A and B), which demon- 
strated a high degree of specificity and suggested 
that the activity of these bivalent compounds 
was on target. 

Analysis of the pharmacokinetic properties of 
AI-4-57 showed that the compound has a short 
half-life (ty 2 = 37 min) in mouse plasma (fig. S5) 
and that loss of the methyl group from the 
methoxy functionality is the primary metabolite. 
Trifluoromethoxy (CF3O) substitutions have been 
shovm to be less reactive (18, 19), so we synthe- 
sized AI-10-47 v^th this substitution. FRET mea- 
surements show that this substitution actually 
enhances the activity of the monovalent com- 
pound (Table 1). Measurements of stability in 
liver microsomes showed that AI-10-47 reduced 
the metabolic liability and so justified the syn- 
thesis of the bivalent derivative AI-10-49 (Table 
1). AI-10-49 is potent (FRET IC50 = 260 nM) (Table 1) 
[isothermal titration calorimetry (ITC) measure- 
ments yielded a dissociation constant (Xd) = 
168 nM] (fig. S6), has improved in vivo pharma- 
cokinetic properties (ty 2 = 380 min) (fig. S5), and 
has enhanced inhibitory activity on ME-1 cell 
growth (IC50 = 0.6 pM) (Fig. IF) compared ^vith 
the parent protonated bivalent compound AI-4-83 
(IC50 of ~3 pM) (Fig. IE). Note that AI-10-49 showed 
negligible activity (IC50 > 25 pM) in normal human 
bone marrow cells (Fig. IG), which indicated a 
robust potential therapeutic v^ndow. In a panel 
of 11 human leukemia cell lines, ME-1 cells were 
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the only cell line highly sensitive to AI-10-49 
(fig. S7). 

The specificity of AI-10-49 in disrupting en- 
dogenous RUNXl binding to CBFp-SMMHC ver- 
sus CBFp was assessed in ME-1 cells. AI-10-49 
effectively dissociated RUNXl from CBFp-SMMHC, 
with 90% dissociation after 6 hours of treatment, 
whereas it had only a modest effect on CBFp- 
RUNXl association (Fig. 2A). The stability of 
RUNXl, CBFP, and CBFP-SMMHC was not affected 
by AI-10-49 (fig. S8A). Expression of the RUNXl- 
regulated genes RUNX3, CSFIR, and CEBPA is 
repressed by CBFp-SMMHC in inv(16) AML {20-22). 
Previous studies have shown decreased RUNXl 
binding to target genes in the presence of CBFp- 
SMMHC {23, 24), vdiich suggests that CBFp-SMMHC 
represses RUNXl taiget genes by blocking binding 
of RUNXl to target DNA sites (Fig. 2B). Consistent 
with this model, chromatin-immunoprecipitation 
(Chip) assays showed that treatment of ME-1 cells 
for 6 hours with AI-10^9 increased RUNXl oc- 
cupancy 8-, 2.2-, and 8-fold at the RUNX3, CSFIR, 
and CEBPA promoters, respectively, whereas no 
enrichment was observed at control loci (Fig. 2C 
and fig. S8, B and C). In accordance with this, 
treatment of ME-1 cells for 6 or 12 hours with AI- 
10^9 increased expression of RUNX3, CSFIR, and 
CEBPA but had no effect on control gene PINl (Fig. 


2D). Neither of these effects was observed in inv(16)- 
negative U937 cells. These data establish AI-10-49 
selectivity in inhibiting CBFp-SMMHC binding 
to RUNXl and validate our approach of using 
bivalent inhibitors to achieve this specificity. 

Up to 90% of inv(16) AML patients have 
cooperating mutations in components of the 
receptor tyrosine kinase pathway, including 
N-RAS and c-Kit (25). We have recently devel- 
oped an efficient mouse model of inv(16) AML, 
by combining the conditional and 

Cbft/^™^^ alleles {26). To test the effects of AI-10- 
49 administration in vivo, we transplanted mice 
with leukemic ceHs, waited 

5 days for engraftment, and then treated mice 
with vehicle [dimethyl sulfoxide (DMSO)] or 
200 mg/kg of body weight AI-10-49 for 10 days, 
and assessed the effect on disease latency. As 
shown in Fig. 3A, vehicle-treated mice succumbed 
to leukemia with a median latency of 33.5 days, 
whereas AI-10-49 -treated mice survived signif- 
icantly longer (median latency = 61 days; P = 
2.7 X 10"^). Thus, transient treatment with AI- 
10-49 reduces leukemia expansion in vivo. Al- 
though we have not assessed toxicity after long- 
term exposure, after 7 days of administration of 
AI-10-49, we observe no evidence of toxicity 
(figs. S9 to Sll). 


To test the potential utility of AI-10^9 for use 
in human inv(16) leukemia treatment, we eval- 
uated the survival of four primary inv(16) AML 
cell samples treated for 48 hours with a dose 
range of monovalent AI-10-47 and bivalent AI-10- 
49. As shown in Fig. 3B, the viability of inv(16) 
patient cells was reduced by treatment with AI- 
10-49 at 5 and 10 pM concentrations (individual 
dose-response experiments are shown in fig. S12). 
Note that the bivalent AI-10-49 was more potent 
than the monovalent compound AI-10-47 and so 
recapitulated the effects observed in the human 
inv(16) cell line ME-1. In contrast, the viability of 
normal karyotype AML samples was not affected 
by AI-10-49 treatment (Fig. 3C). Analysis of an 
additional set of five AML samples revealed that 
AI-10-49 treatment specifically reduces the via- 
bility of inv(16) leukemic cells without having an 
apparent effect on their differentiation (fig. S13). 
AJ-10-49 specificity was also evident when we 
assessed the ability of AML cells to form colonies 
by evaluating colony-forming units (CFUs) after 
compound exposure. The ability of inv(16) AML 
cells to form CFUs was selectively reduced by 
AI-10-49 when compared with normal karyotype 
and t(8;21) AML patient samples (Fig. 3D). This 
inhibitory effect was dose-dependent (40 and 
60% at 5 and 10 pM, respectively) (Fig. 3E), 


Table 1. Chemical structures and IC 50 values of AI-10-49 and related compounds, determined by using the FRET assay. 


Compound Name 

Compound Structure 

FRET ICso (mM) 

AI-4-88 


>240 

AI-4-57 

U-tj' 

22 ±8 

AI-10-11 

H 

14±4 

AI-10-19 

HsCO^^^N /=x /=\ 

H H 

>2.5 

AI-4-83 

H H 

0.35 ± 0.05 

AI-4-82 

n /=\ n^^-wOCH3 

"“AMU? 

H 

0.44 ± 0.07 

AI-4-71 

H 

0.37 ± 0.07 

AI-10-47 


2.0 ±0.3 

AI-10-49 

H H 

0.26 ±0.01 
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Fig. 1. Development of potent selective inhibitor of CBFp-SMMHC-RUNX 
binding. (A) HSQC spectrum (peaks correspond to all NH moieties of 
protein) of CBFp alone (blue) and CBFp + AI-4-57 (red). (B) Schematic dia- 
gram for the application of polyvalency to develop a specific and potent in- 
hibitor of CBFp-SMMHC-RUNX binding. Equations refer to predicted Kq 
values for a bivalent inhibitor binding to CBFp and CBFp-SMMHC. Ceff is the 
effective local concentration, which depends on the distance between CBFp 
domains in the oligomeric CBFp-SMMHC. (C) FRET assay measurements 
for bivalent inhibitors with varying linker lengths with 10 nM Cerulean-Runt 
domain and 10 nM Venus-CBFp-SMMHC. They axis is the ratio of emission 
intensities at 525 and 474 nm. Three independent measurements were 
performed, and their average and standard deviation were used for IC 50 
data fitting. (D) FRET assay measurements of inhibition of CBFp-SMMHC- 
RUNX binding for AI-4-57 and AI-4-83 with 10 nM Cerulean-Runt domain 
and 10 nM Venus-CBFp-SMMHC. Data for AI-4-83 are the same as pre- 
sented in (C). Data for these two compounds are presented separately for 


clarity of comparison to one another. Left y axis is the ratio of emission in- 
tensities at 525 and 474 nm. Right y axis indicates the FRET ratios observed 
with addition of 100 nM and 1000 nM untagged CBFp, corresponding to 
roughly 1-fold and 10-fold dissociation of CBFp-SMMHC and Runt domain 
[CBFp-SMMHC binds with 7-fold the affinity of CBFp ( 8 )]. Three independent 
measurements were performed, and their average and standard deviation 
were used for IC 50 data fitting. (E) Dose-dependent effect of a 24-hour treat- 
ment of ME-1 cells with bivalent inhibitors with varying linker lengths measured 
by MTT assay and normalized to the DMSO-treated group. Each symbol 
represents the mean of triplicate experiments: error bars represent the SD. 
(F and G) Dose-dependent effect of AI-10-47 (red) and AI-10-49 (black) 
treatment for 48 hours: (F) ME-1 cells were assessed by annexin V and 7- 
amino-actinomycin (7AAD) viability staining, and (G) human bone marrow 
cells were assessed by MTT assay. The data were normalized to the DMSO- 
treated group. Each data point represents the mean of triplicate experiments: 
error bars represent the SD. 
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whereas there was no change in CPUs of AML 
cells treated with AI-10-47, AML cells with nor- 
mal karyotype (Fig. 3F), or CD34+ cord blood 
cells (Fig. 3G). These studies show that AI-10-49 
selectively inhibits viability and CFU capacity 
in inv(16) AML blasts, whereas it has negligible 
effects on AML blasts with normal karyotype or, 
importantly, on normal human hematopoietic 
progenitors. 

Dysregulated gene expression is a hallmark 
of cancer and is particularly important for the 
maintenance of cancer stem-cell properties, such 
as self-renewal, which lead to relapse. As such, 
the targeting of proteins that drive transcrip- 
tional dysregulation, so-called ‘transcription ther- 
apy,” represents an avenue for drug development 
with immense potential. A number of fusion pro- 


teins involving transcription factors have been 
identified as drivers of disease in leukemia {27)', 
sarcoma {28)', and, recently, in prostate cancer 
{29), which provide excellent targets for therapeut- 
ic intervention. Our results provide a proof-of- 
principle for this approach, as AI-10-49 specifically 
inhibits CBF(3-SMMHC-RUNX binding and shows 
efficacy against CBFp-SMMHC-driven leukemia 
in mice with no obvious side effects. Specificity 
of action is a key component in the development 
of a targeted drug. Imatinib, for example, shows 
excellent specificity, and its efficacy in chronic 
myelogenous leukemia is clearly a result of effec- 
tive inhibition of the BCR-ABL fusion protein that 
drives chronic myelogenous leukemia. However, 
even this highly selective agent inhibits both the 
BCR-ABL fusion protein, as well as wild-type ABL. 


The “Holy Grail,” as it were, of targeted therapy 
with fusion or mutated protein drivers of cancer 
is to achieve inhibition of the fusion or mutated 
protein with little to no effect on the wild-t3^e 
protein. This study demonstrates that AI-10^9 
represents an example of such selectivity for 
inv(16) leukemia, as it inhibits CBFp-SMMHC activ- 
ity while having a minimal effect on CBFp func- 
tion. There are relatively few examples of drugs 
targeting transcription factors. ATRA {dJi[-trans- 
retinoic acid) for RAR (retinoic acid receptor) fu- 
sions in leukemia and MDM2-p53 inhibitors are 
successful examples; however, neither of these has 
the selectivity of AI-10-49. In addition, AI-10-49 
has the key properties of a high-quality chemical 
probe recently outlined by Frye (39)— namely, a 
clear molecular profile of activity, mechanism of 
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Fig. 2. Specificity of AI-10-49 activity on CBFp-SMMHC-RUNXl bind- 
ing. (A) Effect of 1 |iM AI-10-49 on CBFp-RUNXl and CBFp-SMMHC-RUNXl 
binding at 3 and 6 hours in ME-1 cells, measured by coimmunoprecipi- 
tation (quantification of three experiments is shown on the right). (B) 
Schematic of the effect of CBFp-SMMHC on RUNXl occupancy and tar- 
get gene expression and the effect of AI-10-49 on occupancy and expres- 
sion. (C) Chromatin immunoprecipitation assay showing RUNXl occupancy 
on RUNX3, CSFIR, and CEBPA in ME-1 and U937 cells treated with 1 pM AI- 


10-49 for 6 hours and represented as fold enrichment relative to DMSO- 
treated cells. Each symbol represents the mean of triplicate experiments: 
error bars represent the SD. (D) Relative expression (qRT-PCR) of RUNX3, 
CSFIR, and CEBPA in ME-1 and U937 cells treated with 1 pM AI-10-49 for 6 
and 12 hours, and normalized to the DMSO control group. Each symbol 
represents the mean of triplicate experiments: error bars represent the SD. 
For all panels, significance was calculated as unpaired t-test, *P < 0.05, or 
< 0 . 001 . 
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Fig. 3. Activity of AI-10-49 in inv(16) mouse mod- 
el and inv(16) AML patient samples. (A) Kaplan- 
Meier survival curve of mice (n = 11 per group) 
transplanted with 2 x 10^ 
leukemic cells and treated between days 5 and 15 
posttransplantation (blue arrows) with DMSO (black 

line) or 200 mg/kg of body weight per day AI-10-49 (red line): Statistics described in the statistical methods section. (B) Percent viability (annexin V and 7AAD 
assay) relative to vehicle control (DMSO) for CD34-r purified primary human inv(16) AML samples treated for 48 hours with either AI-10-49 or AI-10-47 at the 
indicated concentrations. Each symbol represents the average for an individual sample from duplicate treatments. The line represents the mean; error bars 
represent the SD. (C) Percent viability (annexin V and 7AAD assay) relative to vehicle control (DMSO) for primary human AML samples with normal karyotype 
treated for 48 hours with either AI-10-49 or AI-10-47 at the indicated concentrations. Each symbol represents the average for an individual sample from duplicate 
treatments. The line represents the mean of all biological replicates: error bars represent the SD. (D) Percentage of colony-forming units (OFUs) after treatment 
with AI-10-49 relative to vehicle control (DMSO) for primary human AML cells. Each symbol represents the average for an individual sample from duplicate 
treatments: error bars represent the SD. (E) Percent OFUs to vehicle control (DMSO) for 0D34-i- purified primary human inv(16) AML samples treated with either 
AI-10-49 or AI-10-47 at the indicated concentrations. OFU assays were performed in triplicate. Error bars represent the SD. (F) Percent OFUs to vehicle control 
(DMSO) for CD34-<- purified primary human AML samples with normal karyotype treated with either AI-10-49 or AI-10-47 at the indicated concentrations. OFU 
assays were performed in triplicate. Error bars represent the SD. (G) Percent OFUs to vehicle control (DMSO) for 0D34-i- purified primary 0D34-t- cord blood cells 
treated with either AI-10-49 or AI-10-47 at the indicated concentrations. Significance calculated as unpaired f test, *P < 0.05 or ***P < 0.001. 


action, identity of active species, and proven utility. 
Development of agents, like AI-10-49, which can 
inhibit the driver mutation(s) in specific types 
of cancer, is essential for better therapeutic out- 
comes for patients. 

In summary, AI-10-49 is a potent and specific 
first-generation CBFp-SMMHC lead compound 
that induces cell death in inv(16) leukemic cells. 
The work described here provides additional 


evidence that transcription factor drivers of can- 
cer can be directly targeted. 
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HUMORAL IMMUNITY 

Apoptosis and antigen affinity limit 
effector cell differentiation of a 
single naive B cell 

Justin J. Taylor/’^* * Kathryn A. Pape,^ Holly R. Steach,^ Marc K. Jenkins^ 

When exposed to antigens, naive B cells differentiate into different types of effector cells: 
antibody-producing plasma cells, germinal center cells, or memory cells. Whether 
an individual naive B cell can produce all of these different cell fates remains unclear. Using 
a limiting dilution approach, we found that many individual naive B cells produced only 
one type of effector cell subset, whereas others produced all subsets. The capacity to 
differentiate into multiple subsets was a characteristic of clonal populations that divided 
many times and resisted apoptosis, but was independent of isotype switching. Antigen 
receptor affinity also influenced effector cell differentiation. These findings suggest 
that diverse effector cell types arise in the primary immune response as a result 
of heterogeneity in responses by individual naive B cells. 


A ntibody production results from a differen- 
tiation process that begins when the surface 
form of immunoglobulin (Ig) known as the 
B cell receptor (BCR) on a naive B cell binds 
an antigen (1, 2). BCR signaling causes the 
B cell to migrate to the border of the T cell zone, 
where it receives signals from T cells (3, 4). These 
signals cause the B cell to proliferate and differen- 
tiate into several types of effector cells, including 
short-lived plasma cells, germinal center (GC) cells, 
and GC-independent memory cells (7, 2). GC cells 
then undergo somatic hypermutation in their Ig 
genes, and cells with mutations that improve 
BCR affinity for antigen are selected to become 
GC-dependent memory or plasma cells (i, 2). 

Despite the importance of this process to immu- 
nity and vaccination, it is unclear how individual 
naive B cells simultaneously produce all of the early 
effector cell types. Some studies suggest that dif- 
ferent naive B cell clones only produce a single ef- 
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fector subset, depending on BCR affinity for antigen 
(5-5) or intrinsic stochastic biases of the respond- 
ing clonal population {9). Alternatively, each naive 
B cell may produce all effector cell types, as sug- 
gested by recent work on naive T cells {10-13). 

We addressed these possibilities by tracking 
the fates of antigen-specific naive B cells during 
the primary immune response to the protein an- 
tigen allophycocyanin (APC). Using a sensitive 
antigen-based cell enrichment method {14), we 
found that the spleen and lymph nodes of a 
C57BL/6 (B6) mouse contained about 4000 poly- 
clonal APC-specific naive B cells, which produced 
-100,000 effector cells 7 days after immunization 
with APC in complete Freund’s adjuvant (CFA) 
(Fig. 1, A and B). As expected, the effector cell 
population consisted of B220^‘^'^ antibody- 
secreting plasma cells, CD38“ GLT^ GC cells, CD38^ 
GL7" memory cells, and a few remaining undif- 
ferentiated CD38^ GL';^ activated precursors (APs) 
(75) (Fig. 1, C and D, and fig. SI). 

In vivo limiting dilution was used to assess the 
multipotentiality of a single APC-specific naive B 
cell. Before limiting dilution could be achieved, it 
was necessary to determine the fraction of APC- 
specific naive B cells that responded to immuni- 


zation. Twenty million B cells from CD45.1^ mice 
that were never exposed to APC were labeled 
with the cell division-tracking dye carboxyfiuo- 
rescein succinimidyl ester (CFSE) {16) and trans- 
ferred into CD45.2^ recipients. Donor-derived 
APC-specific B cells were CFSE^^^ 7 days after 
immunization with CFA alone, which is indica- 
tive of cells that had not divided (Fig. IE). After 
the injection of APC in CFA, most donor APC- 
specific B cells were CFSE'°"', and the CFSE*'*®'' 
population was 33% smaller than in mice injected 
with CFA alone (Fig. 1, E and F). These results 
indicated that one in three APC-specific naive B 
cells, or 1 in 60,000 total B cells, proliferated in 
mice immunized with APC. The 33% response 
frequency of APC-specific naive B cells was not a 
limitation of the CFSE dilution assay, because 97 
to 100% of naive MD4 B cells proliferated (fig. S2) 
after the injection of hen egg lysozyme (HEL) or 
duck egg lysozyme (DEL), for which the MD4 
BCR has a high or medium affinity, respectively 
(77). Thus, the 33% responder frequency was a 
feature of the polyclonal APC-specific B cell pop- 
ulation under these immunization conditions. 

Limiting dilution experiments were then per- 
formed, based on the above knowledge and the 
fact that 7.7 ± 2.8% {n = 116 recipients) of donor 
naive B cells survive after transfer. 2 x 10^ or 0.2 x 
10^ CD45.1^ B cells were transferred into CD45.2^ 
mice, with the expectation that an average of 
3.3 or 0.33 APC-responsive CD45.1^ naive B cells 
would survive per recipient. Seven days after 
APC immunization, mice that did not receive 
transferred B cells contained two or fewer CD45.U 
background events (Fig. 2A). All mice that received 
2 X 10^ B cells contained a defined population of 
CD45.1^ donor-derived APC-specific B cells that 
had proliferated in response to APC (Fig. 2, A and 
B). In contrast, 19% (74 out of 384) of mice that 
received the limiting number of 0.2 x lO^ B cells 
contained donor-derived APC-responsive B cells 
(Fig. 2, B and C). Based on the Poisson distribu- 
tion (75), over 91% of the donor-derived popula- 
tions in this group were the progeny of a single 
naive B cell. 

Extensive effector cell heterogeneity was ob- 
served in the progeny of individual naive B cells. 
Single naive B cells produced between 4 and 957 
progeny, with a median of 16 (Fig. 2C). The poly- 
clonal naive cell populations of recipient origin 
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produced all effector cell subsets, but 35 of the 74 
clonal populations (44%) contained only plasma 
cells, only GC cells, only GC-independent memory 


cells, or only AP cells (Fig. 2, D and E). Four clonal 
populations contained all four subsets, and many 
contained two or three. 


Large clonal populations were more likely to 
contain multiple subsets. The clonal populations 
that contained all four subsets had a median of 





cells (PC) 


CFA alone 


APC/CFA 




Fig. 1. Assessing the polyclonal APC-specific 
B cell response. (A) Detection and (B) quantita- 
tion of APC-specific B cells from pooled spleen and 
lymph node samples enriched using anti-APC micro- 
beads from naive mice (n = 18) or mice immunized 
with APC in CFA (n = 45) 7 days earlier. (C) Detection 
and (D) quantitation of APC-specific plasma cells 
(Ig^'®^), GC (Ig^ B220^ CD38" GL7^), naive/memory 
(M) (Ig^ B220^ CD38^ GIT), and AP B cells (Ig^ B220^ 
CD38^ GL7^). Memory B cells were quantitated by 
the increase in CD38^ GLT^ cells over uninjected con- 
trols. (E) Detection and (F) quantitation of CFSE^'^^ 
CD45.r APC-specific B cells from CD45.2^ mice that 
received 2 x lO^ donor CD45.r B cells before im- 
munization with APC in CFA, or CFA alone (n = 10). 
Numbers on the flow cytometry plots in (A), (C), and 
(E) reflect the percent of cells within the gated 
population. These percentages and knowledge of 
the total number of B cells in the enriched fraction 
were used to calculate the number of cells shown in 
(B), (D), and (F).The bars represent the mean and P 
values determined using an unpaired two-tailed 
Student’s t test. Data points were combined from 
two to six experiments. 
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Fig. 2. Assessing the response of an individual naive 
APC-specific B cell. (A) Detection of APC-specific donor 
cells from CD45.2^ recipients that received 0.2 or 2 x 10® 
CFSE or Celltrace violet (CTV)-labeled CD45.r B cells 
1 to 3 days before immunization with APC in CFA. Sam- 
ples were analyzed 7 days after immunization, after simul- 
taneous CD45.1 and APC-based cell enrichment. The fourth 
and fifth plots show CFSE profiles for gated populations a 
and b from the second and third plots. Numbers on the 
plots reflect the percent of cells within the gated population. 

(B) Frequency of immunized recipient mice (n = 6 for mice 
that received 2 x 10® cells or n = 384 for those that received 
0.2 X 10®) containing an APC-specific CFSE/CTV'°'" donor 
population above the limit of detection (LCD) of two cells. 

(C) Total number of cells in APC-specific clonal populations 
from 74 mice that received 0.2 x 10® cells and contained a 
population above the LCD. (D) Frequency of each subset 
within polyclonal or clonal APC-specific populations. Subsets 
are gated as shown in Fig. 1C, and each row depicts an indi- 
vidual clone (n = 74) or the entire APC-specific population 
from a mouse (n = 45). (E) Frequency of APC-specific clones 
generating one, two, three, or four subsets. (F) Total num- 
ber of cells produced by each clone, separated into groups 
based on the number of subsets produced. A Mann- 
Whitney test was used to generate the P values. Bars in 
(C) and (F) represent medians. Data points were com- 
bined from 12 experiments. 
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352 cells, whereas the populations that contained 
only one had a median of 10 cells (Fig. 2F). No 
relationship was observed between the size of a 
clonal population and the frequency of cells in 
it that divided seven or more times (Fig. 3, A 
and B). This finding pointed toward cell death 


as a basis for the differences in clonal popula- 
tion size. Consistent with the idea that cell death 
limits population size, most clonal populations 
contained less than 20% of the minimum num- 
ber of daughter cells expected based on their 
CFSE profile (Fig. 3C). This effect was not uni- 


form across all populations. Large clonal pop- 
ulations that contained all four subsets exceeded 
the expected minimum number of daughter 
cells, whereas the small populations that con- 
tained only one subset had a median of only 12% 
(Fig. 3D). 


Fig. 3. Assessing proliferation and apoptosis of APC- 
specific clones. (A) Frequency of cells in each CFSE/CTV 
division bin (Div) in wild-type (n = 74) or Bim-deficient 
(n = 34) APC-specific clonal populations. Clones are dis- 
played in the same order as in Fig. 2D. (B) Total number 
of cells detected for each wild-type (open circle) or Bim- 
deficient (black triangle) clone compared to the frequency 
of cells completing at least seven divisions. (C and D) 
Number of cells detected for each clone displayed as a 
percentage of the minimum number predicted based on 
CFSE/CTV dilution analysis, with the clones grouped in 
(D) based on the number of subsets produced. (E) Total 
number of cells produced by APC-specific Bim-deficient 
clones. (F and G) Frequency of clones that produced (F) 
the indicated number or types of subsets or (G) any of the 
indicated subset. The bars in (C) to (E) represent medians. 
P values were determined in (C) and (D) using a Mann- 
Whitney test and in (F) and (G) using Fisher’s exact test. 
Data points were combined from 3 to 12 experiments. 
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Fig. 4. Assessing the response of an individual 
BCR transgenic B cell. (A) Representative detec- 
tion of CD45.2'" IgM*' CFSE'"” donor cells from 
recipients of a limited number (5 to 15 cells) of 
MD4 Ragr^~ B cells 1 to 3 days before immuni- 
zation with HEL-OVA or DEL-OVA in GFA. Sam- 
ples were analyzed 7 days after immunization after 
GD45.2-based cell enrichment. The second and third 
plots show gated populations a and b from the first 
and second plots. Numbers on the plots reflect the 
percent of cells within the gated population. (B) 
Total number of cells in HEL-stimulated MD4 (black 
circles, n = 31), MD4 DEL-stimulated (gray circles, 
n = 40), or AID-deficient APG-specific (white cir- 
cles, n = 27) clonal populations generated by im- 
munization. (C and D) Frequency of clones that 
produced (G) the indicated subset combinations 
or (D) or any of the indicated subset. (E) Amount 
of APG staining of memory cells in clonal APG- 
specific populations that produced memory cells 
and plasma cells or memory cells but not plasma 
cells. The bars in (B) and (E) represent medians. 
P values were determined in (B) and (E) using a 
Mann-Whitney test and in (G) and (D) using Fisher’s 
exact test. Data points were combined from 17 
MD4 experiments, 3 AID-deficient experiments, 
and 12 wild-type experiments. 
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These results suggested that the multipotential- 
ity of a single naive B cell was related to the pro- 
duction of progeny resistant to cell death. This idea 
was tested using B cells lacking the proapoptotic- 
mediator Bim (19). Bim-deficient and wild-type B 
cells fluxed calcium and proliferated equally in 
response to BCR signaling in vitro (fig. S3). Single 
Bim-deficient, APC-specific, naive B cells, however, 
produced 3.4-fold more progeny than wild-type 
clones (P < 0.0001, compare Fig. 2C to Fig. 3E) in 
response to APC immunization. Bim-deficient 
clones were also more likely to produce multiple 
effector cell subsets (Fig. 3F), especially those 
containing plasma cells, GC cells, and AP cells 
(Fig. 3G). Unlike wild-type clones, Bim-deficient 
clonal populations showed a significant correla- 
tion between the number of cells and cell division 
(Fig. 3B), suggesting that they experienced less 
apoptosis. Consistent with apoptosis limiting pop- 
ulation size, most Bim-deficient clones approached 
or exceeded the minimum number of daughter cells 
expected based on their CFSE profile (Fig. 3C). Thus, 
the capacity of a single naive B cell to produce many 
effector cells and multiple subsets appears to be lim- 
ited by Bim-mediated apoptosis, although suppres- 
sion of proliferation by Bim could also contribute. 

The multipotentiality of a single naive B cell 
could also be influenced by BCR affinity for anti- 
gen. This was tested by comparing the response 
of single BCR transgenic MD4 B cells to high- 
affinity (HEL) or medium-affinity (DEL) antigens 
to that of activation-induced cytidine deaminase 
(AID)-deficient APC-specific B cells (Fig. 4, A and 
B), which like MD4 cells are unable to undergo 
class switching (20). Single AID-deficient APC- 
specific B cells produced a similar number (com- 
pare Fig. 2C to Fig. 4B) and diversity (Fig. 4C) of 
effector cells as their wild-type counterparts, in- 
dicating that class switching does not play a ma- 
jor role in differentiation at the early time point 
analyzed in these experiments. In contrast, single 
naive MD4 cells stimulated with HEL or DEL 
produced more progeny than single APC-specific 
cells stimulated with APC (Fig. 4, A and B). 58% 
of HEL-stimulated single MD4 cells produced 
only plasma cells and GC cells, a combination 


that occurred in few clonal APC-specific B cell 
populations (Fig. 4C). This idiosyncrasy was re- 
lated to BCR affinity, because only 12.5% of single 
MD4 cells produced this pattern when stimu- 
lated with the lower-affinity antigen DEL (Fig. 4C). 
In addition, HEL-stimulated clones generated 
more plasma cells and fewer memory cells than 
DEL-stimulated clones (Fig. 4D), which is consist- 
ent with earlier work in the MD4 system (5, 8). 
The tendency of clones with higher-affinity BCRs 
to produce plasma cells was also observed in the 
polyclonal repertoire. BCR affinity for antigen 
was indirectly measured as the amount of APC 
bound to memory cells, which express BCR at 
levels similar to naive cells (fig. SIA). Among the 
APC-specific clones that produced memory cells, 
those that also produced plasma cells bound 
more APC than those that did not (Fig. 4E). 
Together, these data indicate that BCR affinity 
for antigen influences the precise effector cell 
pattern produced by a naive B cell. 

Overall, our results demonstrate that individ- 
ual naive B cells vary greatly with respect to the 
number and types of effector cells generated ear- 
ly during the primary response, as reported for 
naive T cells (10-13). Analogous to CD4^ T cells 
(12), the precise effector cell subset pattern pro- 
duced by a single naive B cell was influenced by 
BCR affinity for antigen. Unlike T cells, however, 
many individual naive B cells only produced a 
single type of effector cell, which was associated 
with Bim-mediated apoptosis. This situation could 
come about because naive B cells are biased toward 
the production of only one subset (9). The progeny 
of different clones may then experience different 
levels of trophic signals from T cells or cytokine 
receptors. Some clonal populations may prema- 
turely stop receiving these trophic signals, result- 
ing in apoptosis of some of their members and 
cessation of further differentiation. Other cells 
that continued to receive trophic signals may be 
protected by these signals from apoptosis, allow- 
ing further proliferation and the generation of 
additional effector cell subsets. Together, the com- 
bination of extrinsic heterogeneity in trophic sig- 
nals and intrinsic heterogeneity in BCRs expressed 


by the population of naive B cells specific for an 
antigen ensures that a diverse set of effector cells 
types is produced during the primary response. 
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In celebration of our 40th anniversary, New England Biolabs recently 
introduced the Passion in Science Awards™, recognizing those within 
the scientific community working to solve many of today’s challenges. 
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event. Award winners visited NEB’s campus in Ipswich, MA, for two 
days of discussions, presentations, tours and an awards celebration. 


2014 Passion in Science Award winners pictured with their hosts outside New England Biolabs’ facility in Ipswich, MA. 


Learn more about our award winners and their inspiring projects 
in our latest video at www.neb.com/PassionInScience 
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Japan prize selection criteria 

The Japan Prize is awarded to scientists and engineers from 
around the world who are recognized as having achieved 
original and dramatic accomplishments that greatly enhance 
the progress of science and technology, thereby contributing 
to the peace and prosperity of mankind. 
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Engagement 

Science is a global endeavor that advances when knowledge 
is both generated and shared. Increasingly, scientists and 
engineers are working both within and outside of national 
boundaries on local and global issues. 

Challenges necessitating innovation and international 
scientific collaboration are abundant in food and water 
security, sustainable development, infectious disease 
and health, climate change, natural disasters, and energy. 
Countries with varying levels of development, education, 
and scientific capacity may have different goals and 
expectations for international scientific engagement. 

What elements make international collaboration successful 
and sustainable? What engagement opportunities are 
available, and what are the responsibilities of researchers, 
entrepeneurs, educators, and policymakes in global 
scientific endeavors? 
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Greying the Wheels 

Lipids have traditionally been among the hardest biomolecules 
to study, but new technologies and techniques are gradually 
revealing more of the lipidome. By Alan Dove 

B iology is swaddled in lipids. Fats, oils, and waxes en- 
close cells and organelles, mediate vast networks of 
information flow, protect fragile tissues from hostile 
environments, and store essential energy for all kinds of organ- 
isms. Unfortunately, lipid structures don’t rely on the sorts of 
well-defined building blocks and simple rules that govern DNA, 
RNA, and proteins. From the complete DNA sequences of the 
biosynthetic genes in a cell, for example, one can reliably pre- 
dict the RNA transcripts likely to arise from those genes and 
the protein sequences they’ll encode; the lipids synthesized by 
those proteins, though, will remain mostly mysterious. 

This disjunction, between the relatively predictable patterns of 
nucleic acids and proteins and the largely inscrutable world of 
lipids, has skewed the progress of biochemistry for years. Ge- 
nome sequencing technology and genomics quickly fed the rise 
transcriptomics and proteomics. As those fields charged ahead, 
amassing terabytes of data on the biology of RNA and proteins 
respectively, lipid biochemists felt largely left out. 

That began to change in 2003, as a major National Institutes 
of Health (NIH)-funded effort set out to establish the field of 
lipidomics. In the ensuing decade, researchers and equipment 
makers have developed several new techniques and tools for 
analyzing these challenging molecules. 

The fat of the land 

“In 2002 ... there were no publications using the word ‘lipi- 
domics,”’ says Edward Dennis, Ph.D., professor of chemistry, 
biochemistry, and pharmacology at the University of California 


in San Diego. That year, Dennis proposed a project to create 
the field, and the NIH’s National Institute of General Medical 
Sciences (NIGMS) agreed to fund it. The result was the 10-year, 

$73 million Lipid Metabolites and Pathways Strategy (Lipid 
MAPS) effort, which Dennis oversaw. 

Dennis started by recruiting five other researchers at institu- 
tions across the country, each an expert on a particular type of 
lipid. The scientists began developing techniques to isolate and 
quantify their assigned lipid categories from biological samples, 
but they ran into a major problem immediately. “We came to 
realize that there was no classification for lipids that was suit- 
able for the bioinformatics age,” says Dennis, adding that “you 
need a taxonomy to classify them and ... store all the billions of 
bits of information we were intending to get.” 

To answer that need, the team invented a new classification 
scheme, dividing all lipids into eight major categories, six of 
which are found in mammals. The group also revised the no- 
menclature for lipid structures and developed a standardized 
system for drawing them. The Lipid MAPS website now features 
a searchable database with more than 40,000 structures, all 
classified and drawn using the new system, which researchers 
worldwide have also adopted. “I would say virtually everybody 
in the world today who talks about a lipid uses this structural 
representation,” says Dennis. 

With the taxonomy in place, the Lipid MAPS researchers fo- 
cused on lipidomics protocols. All of the labs began with identi- 
cal equipment, based on the AB Sciex QTRAP 4000 Liquid 
Chromatography/Mass Spectrometry (LC/MS) system that was 
then considered the state of the art. Each scientist optimized 
fractionation methods to isolate one specific category of lipids, 
then added radioactively labeled standard compounds before 
performing mass spectrometry. The combination of fraction- 
ation and mass spectrometry identified the lipids in the sample, 
and the internal standards allowed precise quantitation of them. 
Combining the data from all six labs produced complete lipido- 
mic profiles of some samples that answered specific biological 
questions. Lipid MAPS also discovered a number of novel lip- 
ids, though that wasn’t their primary goal. 

In an era when genomics tools are a mouse-click away and 
most major universities run dedicated core labs for proteomics, 
the scale and intensity of the six-facility Lipid MAPS approach 
may seem out of reach for nonspecialists. It probably is. “It 
would be very expensive to set it up from scratch,” Dennis con- 
cedes. With the conclusion of the Lipid MAPS project, though, 
many of the individual participants have continued to keep their 
doors open. Dennis’s group, for example, offers its lipidomics 
services to other researchers through collaborative grants or 
recharge funding. o 

The techniques and technologies for lipidomics both evolved g 
during the decade-long project. Dennis’s lab now uses an AB | 

Sciex QTRAP 6500M— the successor to the 4000— combined ^ 

with an Acquity ultrahigh-performance liquid chromatography | 
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(UPLC) system from Wa- 
ters in Milford, Massachu- 
setts. Other Lipid MAPS 
groups, and the growing 
population of lipidomics 
researchers outside the 
project, have adopted 
other strategies and tools 
depending on the types of 
experiments they pursue. 

Wax on, wax off 

“There isn’t one method 
that people are using 
everywhere,” says Ruth 
Welti, Ph.D., director of 
the lipidomics core facil- 
ity at Kansas State Uni- 
versity in Manhattan, Kansas. One reason for the diversity of 
techniques is that lipidomics researchers fall into two separate 
camps: high precision and high throughput. High-precision proj- 
ects, most notably, focus on obtaining accurate measurements 
of the levels of specific types of lipids. High throughput experi- 
ments, in contrast, tend to process large numbers of samples 
looking for relative changes in lipid levels, “sort of more like a 
gene expression analysis, where the absolute amounts aren’t so 
important but the comparison across the samples is important,” 
says Welti. 

Welti is a practitioner of the high throughput approach. In- 
stead of fractionating samples and looking for specific types 
of lipids in each fraction, she and her colleagues infuse crude 
samples directly into a Waters triple-quadrupole mass spec- 
trometer. That allows the team to analyze thousands of samples 
for a single experiment, a useful capability for Welti’s large-scale 
studies on agriculturally important plant phenotypes. 

Besides enabling higher throughput, direct infusion also miti- 
gates one of the drawbacks of LC-coupled MS. Lipids in a sam- 
ple can affect one another’s ionization in the mass spectrometer. 
By separating the sample into distinct groups of lipids, LC 
sends particular subsets of molecules into the mass spectrom- 
eter in distinct pulses. “If you use LC, then at some points the 
percentage of particular lipids that are ionized might relate to 
what other lipids they’re going in with,” says Welti. By sending 
the entire sample through without fractionation, the concentra- 
tions of all of the lipids remain constant relative to each other. 

Direct infusion has its own weaknesses, of course, and re- 
searchers using the technique have to take steps to address 
them. For example, most high throughput experiments need 

2 quality control samples made from a pool of all of the samples 
I being analyzed. The pooled sample contains all of the lipids 

H that could show up in any experimental sample. “A big problem 

^ with mass spec in general is normalization of data across time 

^ on the same mass spec, across different machines, across dif- 
I ferent labs, so having reference standards that contain all of the 

3 compounds that are in your samples is really helpful and can 


increase the precision of 
the data a lot,” says Welti. 

Lipidomics in general, 
and direct infusion in par- 
ticular, can also be a messy 
business. “There isn’t any 
instrument that doesn’t 
need to be cleaned a lot if 
you’re [studying] lipids by 
direct infusion,” says Welti. 

Despite using different 
approaches, Welti and 
Dennis agree on the need 
for specialized lipidomics 
facilities. The combination 
of technical complexity and 
often high instrument main- 
tenance causes many core 
facilities to avoid lipidomics. As a result, Welti estimates that her 
group has collaborated with about 400 other laboratories in the 
past decade. “It’s a big need and people don’t have this kind of 
thing available to them,” she says. 

In addition to academic facilities such as Welti’s, at least one 
company, Avanti Polar Lipids in Alabaster, Alabama, also offers 
lipidomics services to the scientific community. Avanti worked 
on the Lipid MAPS project and now makes its facilities available 
for a fee. 

Don’t fear the hydrophobes 

As more researchers discover the potential of lipidomics, 
though, some are starting to do the work themselves rather 
than collaborate with a dedicated lipid facility. “It’s not only the 
Ed Dennises of the world who are doing lipidomics today,” says 
Fadi Abdi, Ph.D., senior global market manager for lipidomics, 
metabolomics, and imaging at AB Sciex in Framingham, Mas- 
sachusetts. Abdi adds that “there’s a transition in the market 
from the proteomics side to the metabolomics and lipidomics 
side.” 

Indeed, all of the major mass spectrometer makers now offer 
products specifically for lipidomics. At AB Sciex, that means 
catering to both the high throughput and high-precision markets 
with different mass spectrometry systems. For high throughput 
survey experiments, the company sells triple time-of-flight mass 
spectrometers that provide accurate mass information very 
quickly. “That allows us to do discovery type of work for lipids. 
At the same time, we have another set of products, which are 
the QTRAP platforms, which allows us to do more targeted 
types of work,” says Abdi. 

A few miles away in Waltham, Massachusetts, Thermo 
Scientific now offers several lipidomics systems built around 
the company’s Orbitrap mass spectrometry technology. 
Meanwhile Bruker, in Billerica, Massachusetts, caters to 
the lipidomics market with a unique combination of mass 
spectrometers, thin layer chromatography, and nuclear 
magnetic resonance (NMR) instruments. continued> 
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The diversity of technol- 
ogy on the market reflects 
the unusual challenges of 
lipidomics. Unlike proteins 
and nucleic acids, lipids in 
cells often come in different 
isomeric forms, with identi- 
cal compositions and mo- 
lecular weights but different 
structures. Even a perfectly 
accurate mass measure- 
ment can’t distinguish 
between isomers, so high- 
precision lipidomics studies 
often have to rely on addi- 
tional analytical techniques. 

Separating the samples 
by UPLC before the mass 
spectrometer, or analyzing 
them with methods such as 
NMR afterward, may help fill gaps in the data. 

Besides learning about the strengths and limitations of the 
many equipment configurations, researchers considering tak- 
ing up lipidomics should ask companies about training and 
user-friendliness. Most are happy to guide newcomers. “Even 
if you’re not an expert in lipid analysis, the way lipids fragment 
and the way that you measure them and detect them is quite 
defined,” says Baijit Ubhi, Ph.D., staff scientist in metabolomics 
and lipidomics applications at AB Sciex. Ubhi adds that “we 
have predefined methods [and] application specialists who are 
experts in that field who can get a user up and running fairly 
quickly.” 

Mass spectrometers inevitably come with software as well. 

In addition to the applications that run the machine, most 
manufacturers include data analysis packages with varying 
degrees of sophistication and flexibility. Software optimized for 
proteomics or metabolomics may be useless for lipidomics, so 
scientists who want a complete system should look for lipido- 
mics-specific data handling options. Advanced systems either 
include their own databases of lipids, or tie into the Lipid MAPS 
database to identify specific lipid species in a sample. 

Apps for fats 

Many lipidomics researchers use the software that came with 
their mass spectrometers only for initial data collection, prefer- 
ring to export the data to another program for analysis. For 
some experimenters, that means transferring it into a common 
spreadsheet program such as Microsoft Excel. Dedicated lipi- 
domics specialists, however, often prefer to develop their own 
software. 

Christer Ejsing, Ph.D., associate professor in the department 
of biochemistry and molecular biology at the University of 
Southern Denmark in Odense, Denmark, is one of the scien- 
tists who chose to write his own data analysis program. “If you 
have several hundreds of injections on your instrument and you 
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want to organize the data, 
you don’t want to do that in 
Excel,” says Ejsing. 

As a result, Ejsing and 
his colleagues created 
the Analysis of Lipid Ex- 
periments (ALEX) software 
package, which is specifi- 
cally designed for the type 
of high-precision lipidomics 
Ejsing’s lab pursues. Ejs- 
ing uses a small script to 
export data from his mass 
spectrometry software into 
a database format, then 
uses Tableau, a data visu- 
alization language originally 
developed for the banking 
industry, to create graphs. 

Typical of most software 

from academic researchers, ALEX is open source, free for other 
scientists to download, use, and modify. 

For those pursuing high throughput lipidomics, Welti’s soft- 
ware project, LipidomeDB, may be more useful. LipidomeDB is 
a web-based system that takes Microsoft Excel files of mass 
spectrometry data as inputs and exports data in the same for- 
mat. The freely accessible system has been quite popular with 
Welti’s collaborators. “A lot of people use it, if we run samples 
for people, [they] have the option of processing it themselves or 
having us process it for them,” she says. 

Several other lipidomics labs have developed their own analy- 
sis software as well. Each program is optimized for the lab that 
created it, but most are freely available to other researchers. At 
least one company. Premier Biosoft in Palo Alto, California, 
has also released a commercial stand-alone lipidomics data 
analysis application. 

The wide selection of data analysis tools is both a blessing 
and a curse. While researchers can likely find a ready-built ap- 
plication that will fit their own needs, there’s no standardization 
across the systems to ensure that they’ll handle data consis- 
tently. As a result, scientists working in lipidomics often like to 
export their raw mass spectrometer data to outlets such as the 
European Molecular Biology Laboratory’s MetaboLights data- 
base or Iowa State University’s Plant/Eukaryotic and Microbial 
Systems Resource. That allows other scientists to reanalyze the 
data with their own applications to make comparisons across 
systems. 

Analyzing lipids may never be as straightforward as studying 
nucleic acids or proteins, but lipidomics is clearly off to a strong 
start. A recent search of PubMed revealed nearly a thousand 
publications using the term— not bad for a field that didn’t exist 
1 2 years ago. 

Alan Dove is a science writer and editor based in Massachusetts. 
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The W. M. Keck Foundation has announced a 3-year project to understand Earth’s changing near- 
surface oxidation state and the rise of oxygen through deep time. Central to this objective is creation 
of an open-access Deep-Time Data Infrastructure (DTDI). The DTDI Project Science Manager (PSM) 
will coordinate and integrate the efforts of a team of Earth, life, and data scientists from 6 nodes. 
The PSM’s responsibilities will include: 

• Maintaining regular contact with teams from all 6 nodes (Harvard, Rutgers, Johns Hopkins, 
University of Arizona, RPI, and Carnegie); 

• Organizing annual meetings of all co-investigators; 

• Coordinating the development of varied data resources and ensuring that these resources are 
integrated into a user-friendly open-access interface; 

• Overseeing project budgets and expenditures and preparing regular progress reports and briefing 
materials; 

• Communicating with the press and engaging potential scientist users of the DTDI; 

• Serving as liaison with the National Science Foundation (especially EarthCube), related data 
infrastructure efforts at other institutions, and professional societies; 

• Exploring other funding opportunities, especially for the sustained maintenance and operations of 
the DTDI beyond the initial 3 -year period of Keck support; 

• Engaging in active research related to the development and implementation of the DTDI. 

The PSM must have a Ph.D. in Earth Sciences with at least one year of program management 
experience, as well as expertise in geobiology, geochemistry, and mineralogy/petrology. She/he must 
be familiar with concepts of Earth’s changing near-surface environments in deep time. Programming 
skills are not required, but familiarity with database development, management, and use are desirable. 
The full-time, 2. 5 -year position will be based at the Carnegie Institution’s Broad Branch Road campus 
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The Carnegie Institution of Washington is an Equal Opportunity Employer. 



Shrmers Hospitals 

for Children’^ 


Assistant Corporate 
Director of Research 
Shriners Hospitais for Chiidren 
internationai 


Shriners Hospitals for Children (SHC) invites 
applications for Assistant Corporate Director 
of Research Programs, at its headquarters in 
Tampa, Florida. 

The applicant will assist in the management of 
an extensive program in basic, translational, 
and clinical research in pediatric congenital 
orthopedic diseases, spinal cord injury and bums 
at its 22 hospitals and eight Research Centers, 
with an annual departmental budget exceeding 
$32 million. Responsibilities include organizing 
peer review of grant applications, grants 
administration, site visit reviews of Research 
Centers, budget and program planning and 
evaluation, and research facilities development. 

The applicant must have a Ph.D., M.D., or 
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UC San Diego 

Assistant, Associate, Full Professor, 
Child Neurology (10-917) 

(Ladder Rank, In Residence, Adjunct, 
Clinical X, HS Clinical) 

School of Medicine - Neurosciences 

The Department of Neurosciences (http://neurosciences.ucsd.edu/) 
at the University of California San Diego is committed to academic 
excellence and diversity within the faculty, staff, and student body. 

We seek an outstanding Developmental Neuroscientist for the 
neurology unit at UCSD/Rady Children’s Hospital San Diego. The 
successful candidate will be a neurologist-neuroscientist who is 
Board Certified in Child Neurology. The successful candidate will 
have a distinguished research program and be clinically active. We 
expect to appoint an individual who is currently at the Assistant, 
Associate, or Professor level. 

To view the full ad posting/application visit: apptrkr.com/578125 

AA-EOE:The University of California is an Equal Opportunity/Affirmative 
Action Employer. All qualified applicants will receive consideration for 
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Contacts: Applicants should submit your application materials in 
one package including (1) a cover letter, including your research 
achievements and future plan, (2) curriculum vitae, (3) reprints of 
major publications, (4) three letters of recommendation, solicited 
by the applicant. Please send application materials to Ms. Fan 
Lihua (1-5 Beichen West Road, Chaoyang District, Beijing 
100101, P.R., China. Emaii: fanih@ioz.ac.cn; Teiephone/Fax: 
+86-10-64807313. Further information is available at 
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Review of applications will begin Feb 15, 2015, and will continue 
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UNIVERSITY OF SOUTH ALABAMA 
College of Medicine 

Department of Physiology and Cell Biology 

The Department of Physiology and Cell Biology at 
the University of South Alabama College of Medicine 
is seeking to fill two research-intensive tenure-track 
ASSISTANT/ASSOCIATE PROFESSOR positions. 
Existing departmental research interests include cardio- 
vascular, pulmonary, neural, and mitochondrial biol- 
ogy, biomedical imaging, mechanics, and modeling. 
Successful candidates will have research programs that 
complement these ongoing efforts, and will contribute 
to the college’s graduate and medical education mis- 
sion. We are seeking individuals who are highly engaged 
and collaborative, with exceptional written and oral 
communication skills. Competitive applicants must have 
a Ph.D. and/or M.D. degree, postdoctoral experience, 
and demonstrated success in securing extramural grant 
support. Review of applications will begin immediate- 
ly and continue through the application deadline of 
April 1, 2015. Interested candidates must e-mail an 
application packet, including an introductory letter, 
complete curriculum vitae, summary of research and 
teaching interests (one-page limit), and contact infor- 
mation of three references to the Search Committee at 
e-mail: pcbsearch@southalabama.edu. 

Equal Opportunity Employer^Minorities /Females /Veterans / 
Persons with Disabilities. 
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Download 
your free copy 
today. 

ScienceCareers.org/booklets 


From technology specialists to patent 
attorneys to policy advisers, learn more 
about the types of careers that scientists 
can pursue and the skills needed in order 
to succeed in nonresearch careers. 
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Learn more and avoid driving your job search 
around in circles. 


• Search thousands of job postings 

• Create job alerts based on your criteria 

• Get career advice from our Career Forum experts 

• Download career advice articles and webinars 

• Complete an individual development plan at “myIDP” 

Target your job search using relevant resources 

on ScienceCareers.org. 
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WORKING LIFE 


By Eli Kintisch 


The science of schmoozing 

W atching Hugues Lantuit work the cocktail hour at Arctic Change, a Canadian research con- 
ference held in Ottawa in December, is a master class in networking. He’s gracious. (“It 
is very nice to meet you,” he tells a graduate student, shaking with two hands.) He’s self- 
deprecating. (“I’m sure I have no shot at that job.”) He’s a little coarse. (“I tried for half an 
hour to tie this f***ing bow tie; can you help me?”) He is constantly introducing one per- 
son to another. “Ah, you work at the Yukon Research Centre,” he says to the stranger he’s 
persuaded to tie his bow tie, as a crowd forms. “We are actually collaborating on a multimillion-dollar 
grant proposal. So if we get it, I will remember this moment as a formative step.” Laughter all around. 


In the burgeoning world of Arctic 
research, nearly everyone knows 
Lantuit, a professor at the Alfred 
Wegener Institute (AWI) in Pots- 
dam, Germany At age 37, he’s lever- 
aged his deep network of contacts 
to build an impressive career. In 
2006, he co-founded the Associa- 
tion of Polar Early Career Scientists 
(APECS), which now has more than 
5000 members. Five years later, 
he became co-principal investiga- 
tor on a 5-year, €9 million project 
to study permafrost in the Arctic. 

He serves on major international 
research committees. The research 
program he manages, on Herschel 
Island in the Canadian Beaufort 
Sea, is at an important site for mon- 
itoring erosion of Arctic permafrost 
coasts, which make up a third of 
Earth’s shores. “He’s the man,” says 
colleague Paul Overduin, a geosci- 
entist at AWI. “He puts us on the map in a lot of ways.” 

As early as graduate school, Lantuit says, he was “genu- 
inely interested in bringing people together.” As a master’s 
student at McGill University in 2002, he founded the Geog- 
raphy Graduate Society and became a key assistant on an 
international permafrost-mapping project. The experience 
taught him lessons about networking in science that have 
subsequently shaped his career. 

First, he says, what matters isn’t how impressive the title 
on a business card is— it’s the role you play in projects or 
initiatives. Second, while wearing a welcoming face never 
hurts, it’s what you offer scientifically that turns conversa- 
tions into scientific opportunities. 

Lantuit got connected early on, but he found few re- 
sources to help other young Arctic scientists do the same. 
So he and Jenny Baeseman, a microbial biologist, created 
APECS, which runs workshops that bring together young 


and established scientists to explore 
publishing, fieldwork, funding, and 
career advancement. His reputation 
as a connector grew as he matched 
colleagues with opportunities— 
including helping Baeseman, who 
was focused on Antarctica, join 
a project in the Arctic. “He’s just 
someone who’s constantly connect- 
ing his peers,” Baeseman says. 

They connect him in return: 
Lantuit’s CV lists the myriad part- 
nerships, international research 
projects, journal special issues, pan- 
els, and conferences he’s created or 
run. All that connecting takes time 
away from research, he acknowl- 
edges. “There are scientists who will 
publish 50 papers in Nature in their 
career. I won’t. I’m fine with that.” 
Yet, none of this would be possible 
if he hadn’t published well-regarded 
papers on erosion on Arctic coasts. “I 
tell my [Ph.D.] students, you need to deliver scientifically,” he 
says. Several of them wanted to attend the Ottawa meeting, 
but Lantuit said no. “Their dissertations are late.” 

Lantuit’s appetite for new projects and partnerships re- 
mains voracious. Managing his various ongoing efforts taxes 
his phone’s scheduling app, which lists a relentless string of 
meetings, dinners, sessions, and cocktail hours for the 5-day 
meeting. “I have this inner drive to connect,” he says as he 
gets up to walk to the conference’s gala dinner. He has as- 
sembled a colorful table of heavy hitters: influential scien- 
tists, well-known administrators, and funders. These are 
useful people to know, but it’s clear he genuinely likes each 
one. “It’s fun,” he says, and heads for the ballroom. ■ 


Eli Kintisch is a contributing correspondent for Science. For 
more on life and careers, visit www.sciencecareers.org. Send 
your story to SciCareerEditor@aaas.org. 



^He’s just someone who’s 
constantly connecting 
his peers.” 
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